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Abstract 
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We consider single-source single-sink (ss-ss) multi-hop relay networks, with slow-fading links and single-antenna 
half-duplex relay nodes. While two-hop cooperative relay networks have been studied in great detail in terms of 
D ' the diversity-multiplexing tradeoff (DMT), few results are available for more general networks. In this paper, we 

identify two families of networks that are multi-hop generalizations of the two-hop network: A'-Parallel-Path (KPP) 
C*~) ' networks and layered networks. 

KPP networks, can be viewed as the union of K node-disjoint parallel relaying paths, each of length greater 
than one. KPP networks are then generalized to KPP(I) networks, which permit interference between paths and to 
KPP(D) networks, which possess a direct link from source to sink. We characterize the DMT of these families of 
networks completely for K > 3. Layered networks are networks comprising of layers of relays with edges existing 
only between adjacent layers, with more than one relay in each layer. We prove that a linear DMT between the 
maximum diversity d max and the maximum multiplexing gain of 1 is achievable for single-antenna fully-connected 
layered networks. This is shown to be equal to the optimal DMT if the number of relaying layers is less than 4. 
For multiple-antenna KPP and layered networks, we provide an achievable DMT, which is significantly better than 
known lower bounds for half duplex networks. 

For arbitrary multi-terminal wireless networks with multiple source-sink pairs, the maximum achievable diversity 
qq ■ is shown to be equal to the min-cut between the corresponding source and the sink, irrespective of whether the 

network has half-duplex or full-duplex relays. For arbitrary ss-ss single-antenna directed acyclic networks with 
^vq . full-duplex relays, we prove that a linear tradeoff between maximum diversity and maximum multiplexing gain is 

achievable. 

Along the way, we derive the optimal DMT of a generalized parallel channel and derive lower bounds for the 
DMT of triangular channel matrices, which are useful in DMT computation of various protocols. We also give 
alternative and often simpler proofs of several existing results and show that codes achieving full diversity on a 
MIMO Rayleigh fading channel achieve full diversity on arbitrary fading channels. All protocols in this paper are 
explicit and use only amplify-and-forward (AF) relaying. We also construct codes with short block-lengths based 
on cyclic division algebras that achieve the optimal DMT for all the proposed schemes. 

Two key implications of the results in the paper are that the half-duplex constraint does not entail any rate loss 
for a large class of cooperative networks and that simple AF protocols are often sufficient to attain the optimal 
DMT. 
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I. Introduction 

A. Prior Work 

The concept of user cooperative diversity was introduced in [1]. Cooperative diversity protocols were first 
discussed in [2] for the two-hop relay network (FigfTJ) where the authors develop and analyze the Orthogonal 
Amplify and Forward (OAF) protocol and the Selection Decode and Forward (SDF) protocol for the case of a 
single relay network. 

Zheng and Tse [3] proposed the Diversity-Multiplexing gain Tradeoff (DMT) as a tool to evaluate point-to-point 
multiple-antenna schemes in the context of slow fading channels. The DMT was used as a tool to compare various 
protocols for half duplex two-hop cooperative networks in [4], [5]. As noted in [8], the DMT is a valuable tool 
in the study of cooperative relay networks, because it is simple enough to be analytically tractable and powerful 
enough to compare different protocols. 

In [4], the SDF protocol is analyzed for an arbitrary number of relays, where the authors give upper and lower 
bounds on the DMT of the protocol. In these protocols, the relays and the source node participate for equal time 
instants and the maximum multiplexing gain r that could be achieved was 0.5. 

For any network, an upper bound on the achievable DMT has been given by the cut-set bound [8], [33]. A 
fundamental question in this area is whether the two-hop cooperative wireless system in FigfTJcan mimic a Multiple 
Input Single Output (MISO) system with N + 1 transmit antennas and 1 receive antenna and achieve the DMT 
corresponding to the MISO system. This question still remains open, see [9], [10] for a detailed comparison of 
existing achievable regions. 




Fig. 1. Two Hop Cooperative Relay Network 

In [5], Azarian et al. analyze the class of Non Orthogonal amplify and Forward (NAF) protocols, introduced 
earlier by Nabar et al. in [6]. In [5], the authors establish the improved DMT of the NAF protocol in comparison 
to the class of OAF protocols considered in [4]. However it has been shown in [9] that the DMT of the NAF 
protocol can be obtained for the OAF protocols as well using appropriate unequal slot lengths for source and relay 
transmissions. 

The authors of [5] also introduce the Dynamic Decode and Forward (DDF) protocol wherein the time for which 
the relays listen to the source depends on the source -relay channel gain. They show that for the single relay case, 
the DMT of the DDF protocol achieves the transmit diversity bound for r < 0.5, beyond which the DMT falls 
below the transmit diversity bound. 

Jing and Hassibi [7] consider cooperative communication protocols where the relay nodes apply a linear trans- 
formation to the received signal. The network model that they consider is the same as the one shown in Fig. \T\ 
except that there is no direct link between source and sink in their model. The authors consider the case when 
both the source and the relays transmit for an equal number of channel uses and the linear transformation applied 
by the relays are restricted to the class of unitary matrices. Rao and Hassibi [23] consider two-hop half-duplex 
multi-antenna cooperative networks without direct link and analyze the DMT performance. 

Yang and Belfiore consider a class of protocols called Slotted Amplify And Forward (SAF) protocols in [17], 
and show that these improve upon the performance of the NAF protocol [5] for the case of two relays. The authors 
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also provide an upper bound on the DMT of the SAF protocol with any number of slots, and show that this upper 
bound tends towards the transmit diversity bound as the number of slots increases. Under the assumption of relay 
isolation and relay ordering, the naive SAF scheme proposed in [17] is shown to achieve the SAF protocol upper 
bound. 

Yuksel and Erkip in [8] have considered the DMT of the DF and compress-and-forward (CF) protocols. They 
show that the CF protocol achieves the transmit diversity bound for the case of a single relay. We note however, 
that in the CF protocol, the relays are assumed to know all the fading coefficients in the system. The authors also 
translate cut-set upper bounds in [33] for mutual information into the DMT framework for a general multi-terminal 
network. 

Yang and Belfiore in [16] consider AF protocols on a family of MIMO multihop networks (termed as multi- 
antenna layered networks in the current paper). They derive the optimal DMT for the Rayleigh-product channel 
which they prove is equal to the DMT of the AF protocol applied to this channel. They also propose AF protocols 
to achieve the optimal diversity of these multi-antenna layered networks. 

Oggier and Hassibi [27] have proposed distributed space time codes for multi-antenna layered networks that 
achieve a diversity equal to the minimum number of relay nodes among the hops. Recently, Vaze and Heath [28] 
have constructed distributed space time codes based on orthogonal designs that achieve the optimal diversity of the 
multi-antenna layered network. 

Borade, Zheng and Gallager in [22] consider AF schemes on a class of multi-hop layered networks where each 
layer has the same number of relays (termed as Regular networks in the current paper). They show that AF strategies 
are optimal in terms of multiplexing gain. They also compute lower bounds on the DMT of the product Rayleigh 
channel. 

From a capacity perspective as well, there have been some investigations into single-source single-sink wireless 
networks. Recently, Avestimehr, Diggavi and Tse [26] have evaluated the capacity of deterministic wireless networks 
with broadcast and interference constraints. They have also shown that schemes from these deterministic networks 
can be lifted to gaussian networks, to give achievable regions that are within a constant away from outer-bounds. 
However, it must be noted that they consider only full-duplex networks. The degrees of freedom of arbitrary full- 
duplex ss-ss and multicast wireless networks is established in [21] using a connection with deterministic wireless 
networks. 

From the point of code design for multiple antenna systems, Space-Time codes from Cyclic Division Algebra 
(CDA) was introduced in [18]. Certain codes constructed from CDAs were proved to be DMT optimal (in fact 
approximately universal - see [11]) for the general MIMO channel in [12]. These codes were tailored to suit the 
structure of various static protocols for two-hop cooperation and proved to be DMT optimal in [9]. For the Dynamic 
Decode and Forward protocol, DMT optimal codes were constructed for arbitrary number of relays with multiple 
antennas in [13]. Recently, in [14], codes for the single relay single antenna DDF channel were constructed, which 
are not only DMT optimal, but also have probability of error close to the outage probability. In this paper, we 
present a DMT optimal code design for all proposed protocols based on the approximately universal codes in [12]. 

Cooperative networks with asynchronous transmissions have also been studied in the literature [39], [40], [41]. 
However, we consider networks in which relays are synchronized. Codes for two-hop cooperative networks having 
low decoding complexity and full diversity are studied in [42], [41] and [43]. While decoding complexity is not 
the primary focus of the present paper, we do provide a successive-interference-cancellation technique to reduce 
the code length and therefore the complexity. 

B. Classification of Networks 

In this section, we define the classes of networks under consideration here. Unless otherwise stated, all networks 
considered possess a single source and a single sink and we will apply the abbreviation ss-ss to these networks. 

A cooperative wireless network can be built out of a collection of spatially distributed nodes in many ways. 
For instance, we can identify paths connecting source to the sink through a series of nodes in such a manner that 
any two adjacent nodes fall in the Rayleigh zone [8]. This process can be continued barring those nodes which 
are already chosen. Such a construction will result in a set of paths from the source to the sink. In the simplest 
model, we can further impose the constraint that these paths do not interfere each other, see FigfT] thus motivating 
the study of a class of multi-hop network which we shall refer to as the set of K-Parallel Path (KPP) networks. 



Fig. 2. Motivation for the KPP networks 



Alternatively, a layers of relays can be identified from a collection of nodes between the source and the sink. 
This will result in a layered network model, which is described in [22]. 

1) Representation by a graph: Any wireless network can be associated with a directed graph, with vertices 
representing nodes in the network and edges representing connectivity between nodes. If an edge is bidirectional, 
we will represent it by two edges one pointing in either direction. An edge in a directed graph is said to be live 
at a particular time instant if the node at the head of the edge is transmitting at that instant. An edge in a directed 
graph is said to be active at a particular time instant if the node at the head of the edge is transmitting and the tail 
of the edge is receiving at that instant. 

Remark 1: Since most networks considered in this paper will have bidirectional links, we will represent a 
bidirectional link by an un-directed edge. Therefore, un-directed edges must be interpreted as two directed edges, 
with one edge pointing in either direction. 

A wireless network is characterized by broadcast and interference constraints. Under the broadcast constraint, 
all edges connected to a transmitting node are simultaneously live and transmit the same information. Under the 
interference constraint, the symbol received by a receiving end is equal to the sum of the symbols transmitted on 
all incoming live edges. We say a protocol avoids interference if only one incoming edge is live for all receiving 
nodes. 

In wireless networks, the relay nodes operate in either half or full-duplex mode. In case of half duplex operation, 
a node cannot simultaneously listen and transmit, i.e., an incoming edge and an outgoing edge of a node cannot 
be simultaneously active. 

2) K-Parallel-Path Networks: One way of generalizing the two-hop relay network is to consider this network 
as a collection of K parallel, relaying paths from the source to sink, each of length > 1. This immediately leads 
to a more general network that is comprised of K parallel paths of varying length, linking source and sink. More 
formally: 

Definition 1: A set of edges (i>i,t>2), (^2,^3); ■ •• , (v n -i,v n ) connecting the vertices v± to v n is called a path. 
The length of a path is the number of edges in the path. The K-parallel path (KPP) network is defined as a ss-ss 
network that can be expressed as the union of K vertex-disjoint paths, each of length greater than one, connecting 
the source to the sink. Each of the node-disjoint paths is called a relaying path. All edges in a KPP network are 
bidirectional (see Fig. [3). 

The communication between the source and the sink takes place in K parallel paths, labeled with the indices 
Pi, P2, . . ., Pr- Along path Pi, the information is transmitted from source to sink through multiple hops with the 
aid of rii — 1 intermediate relay nodes {RijYjLi ■ 

Remark 2: A network similar to the KPP network in Definition Q] is considered in [37], albeit from a symbol 
error probability perspective. 

Definition Q] of KPP networks precludes the possibility of either having a direct link between the source and 
the sink, or of the existence of links connecting nodes lying on distinct node-disjoint paths. We now expand the 
definition of KPP networks to include both possibilities. 
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Fig. 3. The KPP network 

Definition 2: If a given network is a union of a KPP network and a direct link between the source and sink, 
then the network is called a KPP network with direct link, denoted by KPP(D). If a given network is a union 
of a KPP network and links interconnecting relays in various paths, then the network is called a KPP network 
with interference, denoted by KPP(I). If a given network is a union of a KPP network, a direct link and links 
interconnecting relays in various paths, then the network is called a KPP network with interference and direct path, 
denoted by KPP(I, D). 

Remark 3: We adopt following terminology: For a KPP(D), KPP(I) or a KPP(I,D) network, we consider the 
union of the K node disjoint paths as the backbone KPP network (When there are several choices for the K node- 
disjoint paths, we are free to choose any one set of K node-disjoint paths and refer to this collection of K paths 
as the backbone KPP network). The K relaying paths in these networks are referred to as the K backbone paths. 
A start node and end node of a backbone path are the first and the last relays respectively in the path. 

Fig. 0] below provides examples of all four variants of KPP networks. 




(a) A KPP network (b) A KPP(D) network (c) A KPP(I) network 




(d) A KPP(I, D) network 



Fig. 4. Examples of KPP networks with K = 2 

For a KPP(D), KPP(I) or a KPP(I, D) network, we consider the union of the K node disjoint paths as the 
backbone KPP network. While there may be many choices for the K node disjoint paths, we can choose any one 
such choice and call that the backbone KPP network. These K relaying paths in these networks are referred to as 
the K backbone paths. A start node and end node of a backbone path are the first and the last relays respectively 
in the path. 

In a general KPP network, let Pj, i = 1, 2, K be the K backbone paths. Let Pj have rij edges. The j-th edge 
on the i-th path Pj will be denoted by and the associated fading coefficient by gij. 

3 ) Layered Network: A second way of generalizing a two-hop relay network is to view the two-hop network 
as a network comprising of a single layer of relays. The immediate generalization is to allow for more layers of 
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relays between source and sink, with the proviso that all links are either inside a layer or between adjacent layers. 
We label this class of multi-hop relaying networks as layered networks: 

Definition 3: Consider a ss-ss single-antenna bidirectional network. A network is said to be a layered network 
if there exists a a partition of the vertex set V into subsets Vq, V\, ■ Vl, Vl+i, such that 

• Vq, Vl+i denote the singleton sets corresponding to the source and sink respectively. 

• If there is an edge between a node in vertex set Vi and a node in Vj, then \i — j\ < 1. We assume \Vi\ > 
\,i = \,1,..,L 

We call Vi, Vl as the relaying layers of the network. A layered network is said to be fully connected if for 
any i, v\ € Vi and V2 £ Vi+i, then the («i,«2) is an edge in the network. 

It must be noted that a fully connected layered network may or may not have links inside of a layer. However, 
whenever we say fully connected layered network, it applies to both networks that have intra-layer links and those 
that do not have such links. Examples of both these types of networks are shown in Fig. 5(c)| and Fig. |5(d)| 




(a) A layered network with with 4 relaying 
layers 




(c) A fully connected layered network 




(b) A (3,4) regular network 




(d) A fully connected layered network with 
intra-layer links 



Fig. 5. Examples of Layered and Regular networks 



Every layered network will have a layer containing only the source, and another layer containing only the sink. 
In FigfSJ examples of layered networks are given. Layered networks were also considered in [16] and [22]. In 
particular, [22] considered layered networks with equal number of relays on all layers. We refer to such layered 
networks as regular networks. 

Remark 4: In this remark, we characterize the intersection of KPP(I) networks and layered networks. First we 
observe that one is not contained in the other. Consider the subgraph of a given KPP(I) network graph, consisting 
of all the nodes of the original network except for the source and the sink. This subgraph will have the property 
that the number of node-disjoint and edge-disjoint paths is equal to the number of relay nodes immediately adjacent 
to the source. This is a key property of KPP(I) networks, which in general, does not hold for layered networks. 
On the other hand, there can be cross links between the parallel paths in a KPP(I) network in such a way that the 
network cannot be viewed as being layered. However, these two classes of networks are not mutually exclusive 
and in fact, we term networks that lie in the intersection of the two classes as regular networks. 

Definition 4: The (K,L) Regular network is defined as a KPP(I) network which is also a layered network [16] 
with L layers of relays (see Fig. |5(b)[ ). 

Remark 5: The two-hop relay network [FigfTJ is a KPP(I,D) network with K = M, M being the number of 
relays. If we assume relay isolation, then it is a KPP(D) network with K = M. If we exclude the direct link, then 
we have a (M, 1) regular network. 

C. Setting and Channel Model 

Between any two adjacent nodes v x , v y of a wireless network, we assume the following channel model. 
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y =Hx + w , (1) 

where y corresponds to the received signal at node v y , w is the noise vector, H is a matrix and x is the vector 
transmitted by the node v x . 

We follow the literature in making the assumptions listed below. Our description is in terms of the equivalent 
complex-baseband, discrete-time channel. 

1) All channels are assumed to be quasi-static and to experience Rayleigh fading and hence all fade coefficients 
are i.i.d., circularly-symmetric complex Gaussian C7V(0, 1) random variables. 

2) The additive noise at each receiver is also modeled as possessing an i.i.d., circularly-symmetric complex 
Gaussian CA/"(0, 1) distribution. 

3) Each receiver (but none of the transmitters) is assumed to have perfect channel state information of all the 
upstream channels in the network. Q 

An AF protocol p i.e., a protocol p in which each node in the network operates in an amplify-and-forward 
fashion, induces the following linear channel model between source and sink: 

y =H(p)x + w , (2) 

where y G C m denotes the signal received at the sink, w is the noise vector, H(p) is the (m x n) induced channel 
matrix and x 6 C is the vector transmitted by the source. The components of the n-tuple x are the n symbols 
transmitted by the source and similarly, the components of the m-tuple y represent the symbols received at the 
sink. Typically m equals n. We impose the following energy constraint on the transmitted vector x 

Tr(S x ) := Tr(E{xx f }) < np 

where Tr denote the trace operator, and we will regard p as representing the SNR on the network. We will assume 
a symmetric power constraint on the relays and the source. However it will turn out that given our high SNR 
perspective here, the exact power constraint is not of significant importance. We consider both half and full-duplex 
operation at the relay nodes. 

1 ) Diversity-Multiplexing Gain Tradeoff : Let R denote the rate of communication across the network in bits 
per network use. Let p denote the protocol used across the network, not necessarily an AF protocol. Let r denote 
the multiplexing gain associated to rate R defined by 

R = rlog(p). 

The probability of outage for the network operating under protocol p, i.e., the probability of the induced channel 
in is then given by 

P ont (p,R)= inf Pr(/(x;y) < nR\H(p) = H(p)). 

E x > 0, Tr(S x ) < np 

Let the outage exponent d out (p,r) be defined by 

dout{p,r) = - lim — ■ — -— — 
p->oo log(p) 

and we will indicate this by writing 

p -<U(p,r) = Pout(p ^). 

The symbols >, < are similarly defined. 

The outage d out (r) of the network associated to multiplexing gain r is then defined as the supremum of the 
outages taken over all possible protocols, i.e., 

cUtO) = supd out (p,r). 

p 

'However, for the protocols proposed in this paper, the CSIR is utilized only at the sink, since all the relay nodes are required to simply 
amplify and forward the received signal. 



8 



A distributed space-time code (more simply a code) operating under a protocol p is said to achieve a diversity 
gain d(p, r) if 

P e (p,p)=p- d ^ , 

where P e (p) is the average error probability of the code C(p) under maximum likelihood decoding. Using Fano's 
inequality, it can be shown (see [3]) that for a given protocol, 

d(p,r) < d out (p,r). 

We will refer to the outage exponent d ont (r) as the DMT d(r) of the corresponding channel since for every 
protocol discussed in this paper we shall identify a corresponding coding strategy in Section IIX-AI whose diversity 
gain d(p,r) equals d out (r). 

For each of the networks described in this paper, we can get an upper bound on the DMT, based on the cut-set 
upper bound on mutual information [33]. This was formalized in [8] as follows: 

Lemma 1.1: Given a cut Cj, i = 1, 2, .., M between any source and sink, let n Ci ' log(p) be the rate of information 
flow across the cut. Given a cut, there is a H matrix connecting the input terminals of the cut to the output terminals. 
Let us call the DMT corresponding to this H matrix as the DMT of the cut, cfc; (r^). Then the DMT between 
the source and the sink is upper bounded by 

d(r) < mm{d Cl (r {Ct) )}. 

i 

Definition 5: Given a random matrix H of size m x n, we define the DMT of the matrix H as the DMT of 
the associated channel y = Hx + w where y is a m length received column vector, x is a n length transmitted 
column vector and w is a CM (0,1) column vector. We denote the DMT by dn(-) 

D. Results 

The principal results of this paper are tabulated in Table I. Some of these results were presented in conference 
versions of this paper [19], [20]. We have characterized achievable DMT/diversity for many classes of networks as 
given in the table. When compared against the cut-set upper bound, in many cases, the optimal DMT is achieved. 
In other cases, we prove that a linear DMT between the maximum multiplexing gain and maximum diversity is 
achievable, while the cut-set upper bound can be concave in general. Explicit schemes and code design is established 
for all the achievable DMT. In the table, M refers to the min-cut of the network of interest. 

For arbitrary co-operative networks with multiple sources and sinks, each potentially equipped with multiple 
antennas, we characterize the maximum achievable diversity gain and give a scheme that achieves this maximum 
diversity using an amplify-and-forward protocol in Section. IIV-AI For arbitrary ss-ss networks with full duplex 
operation, we prove that a linear tradeoff between maximum diversity and maximum multiplexing gain is achievable 
using an amplify and forward protocol in Section. [IV] 

For both KPP and layered networks, we propose an explicit protocol that achieves a diversity multiplexing trade- 
off that is linear between the maximum diversity and maximum multiplexing gain points in Section. [VT] For KPP 
networks, this coincides with the upper-bound on the DMT as given by the cut-set bound, thus characterizing 
the DMT of this entire family of networks completely. For layered networks, the cut-set bound turns out to be 
concave in the general case and does not coincide with the achievable region. For general layered networks, we 
give a sufficient condition for the achievability of a linear DMT between the maximum diversity and the maximum 
multiplexing gain in Lemma 17.31 

Along the way, we derive the optimal DMT of parallel channel in Lemma 13.51 provide alternative and often 
simpler proofs of several existing results and in Section. IIX-BI prove that codes achieving full diversity on a MIMO 
Rayleigh fading channel achieve full diversity on arbitrary fading channels. 

In Section. IIX-AI we give explicit codes with short block-lengths based on cyclic division algebras that achieve 
the best possible DMT for all the schemes proposed above. We also prove (Section. IIX-BI ) that full diversity codes 
for all networks in this paper can be obtained by using codes that give full diversity on a Rayleigh fading MIMO 
channel. 

For KPP and layered networks with multiple antenna nodes, we examine certain protocols and establish achievable 
DMT for these protocols in Section. IVIIII 
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TABLE I 

Principal Results Summary 



Network 


No of 
sources/ 
sinks 


No of 
antennas 
in nodes 


FD/ 
HD 


Direct 
Link 


Upper bound on 
Diversity/DMT 

dbound(»") 


Achievable 
Diversity/DMT 

^achieved (f) 


Is upper bound 
achieved? 


Reference 


Arbitrary 


Multiple 


Multiple 


FD/HD 


/ 


d(0) = M 


d(0) = M 


/ 

{d max achieved) 


Theorem 14.11 


Arbitrary 


Multiple 


Multiple 


FD/HD 


X 


d(0) = M 


d(0) = M 


/ 

(d m ax achieved) 


Theorem 14.11 


Arbitrary 
Directed 
Acyclic Networks 


Single 


Single 


FD 


/ 


Concave 
in general 


M(l-r)+ 


A linear DMT 
between d max and 
r max is achieved 


Theorem 14.21 


KPP(K > 3) 


Single 


Single 


HD 


X 


K(l-r) + 


K(l -r)+ 


/ 


Theorem 15. 101 


KPP(D)(K > 3) 


Single 


Single 


HD 


/ 


(A' + l)(l-r) + 


{K +l)(l-r)+ 


/ 


Theoreml5.11l 


KPP(I)(K > 3) 


Single 


Single 


HD 


X 


K{l-r) + 


K(l-r) + 


/ 


Theorem 16.71 


Fully 
Connected 
Layered 


Single 


Single 


HD 


X 


Concave 
in general 


M(l-r)+ 


A linear DMT 
between d max and 
r ma x is achieved. 
/ for L < 4 


Theorem 17.51 
Corollary [7761 


General 
Layered 
(satisfying 
Lemma 17. 3t 


Single 


Single 


HD 


X 


Concave 
in general 


M(l -r) + 


A linear DMT 
between dmax and 
r max is achieved 


Lemma 17.31 


(K, L) Regular 


Single 


Single 


HD 


X 


K{l-r) + 


K(l-r) + 


/ 


Theorem 16.31 



II. Relation to Existing Literature 

In this section, we present how the results in this paper relate to other in this area. Certain results in this paper 
can be used to recover existing results on cooperative communication in a simpler, concise and more intuitive 
manner. 

1) Proof of Conjecture 1 in the paper by Rao and Hassibi [23] and [24]: 

The general NAF protocol considered in Example 3 in Section IIII-EI of the present paper is the same as that 
considered by Rao and Hassibi. The results here proves Conjecture 1 given in [23] and [24]. 

2) The lower bound on the DMT of various AF Protocols: We prove lower bounds on the DMT of various AF 
protocols. While most are previously known, the new method employed here presents a simpler derivation. 
As it turns out, all lower bounds for single antenna systems provided here are tight. 

NAF Protocol: The DMT of the NAF protocol was computed in [5]. We prove a lower bound on the DMT 
which turns out to be tight. 

SAF Protocol: The Slotted Amplify and Forward protocol is proposed in [17] and upper and lower bounds on 
its DMT under relay isolation is evaluated and shown to be equal. For doing so, matrix theoretic techniques 
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are employed in [17]. In the current paper, in Example 2 of Section UlI-EI the lower bound for the same is 
developed using information theoretic techniques, which lends insight into the form of the DMT. 
N-Relay MIMO NAF Channel given in [15]: 

In [15], the authors consider a two-hop relay network with a direct link and N relays. We prove an 
improved lower bound on the DMT for the MIMO NAF protocol considered in that paper (See Example 
4 of Section lITTEl) . 

3) The diversity of arbitrary cooperative networks. 

We characterize completely the maximum diversity order attainable for arbitrary cooperative networks and 
it is shown that an amplify and forward scheme is sufficient to achieve this. Special cases of these were 
derived for the MIMO two-hop relay channel in [15], under a certain condition on the number of antennas 
(See Corollary 1 in that paper). Also, the diversity order of layered networks using amplify and forward 
networks is characterized in [16]. In [38], upper bounds on the diversity order of an arbitrary single-source 
single-sink network under the two cases of common and independent code-books was derived. However, no 
achievability results are given there. 

4) The optimal DMT of the two-hop cooperative channel without direct link. 

The optimal DMT of a (K,L) regular network is derived in Theorem 16.31 in Section |VT] of this paper. In 
an independent (parallel) work by Gharan, Bayesteh and Khandani [25], the optimal DMT of a two-hop 
network, which is a special case of a regular network (in particular it is a (K,l) Network), is derived to be 
d(r) = L(l — r). The protocol they propose is the same as the protocol employed in the present paper. In fact, 
both these protocols are simply the SAF (Slotted Amplify and Forward) protocol [17] applied in the situation 
when there is no direct link between source and sink. It must be noted however, that the proof techniques 
used in this paper are entirely different from those used in [25]. 

5) The DMT of the parallel channel in closed form is obtained in Lemma. 13.51 A special case of this result is 
derived in [16] where the authors characterize the parallel channel DMT when all the individual channels 
have the same DMT. 

6) For an arbitrary full-duplex networks, it is shown in the present paper, that a linear DMT between the 
maximum diversity and the maximum multiplexing gain is achievable. A special case of this result is proved 
for the case of layered networks in [16]. 

A. Outline 

In Section [[III we present techniques and general results which will of use in later sections. In this section, 
we introduce the Information Flow diagram (i-f diagram), and prove the result that min-cut equals diversity. In 
Section UV] we consider the case with full duplex relays. We present schemes achieving optimal DMT for KPP(I,D) 
networks. In Section |Vj we focus on half-duplex KPP networks and present protocols achieving optimal DMT for 
K > 3. In Section |VlJ KPP(I) networks with half-duplex relays are considered, and schemes achieving optimal 
DMT are presented for KPP(I) networks allowing certain types of interference. In Section IVTI1 we consider layered 
networks and show that a linear DMT between max multiplexing of 1 and diversity of d max is obtained, which is 
indeed optimal if the number of layers is lesser than 4. In Section I VIII- A I we consider multi-antenna layered and 
KPP networks and give an achievable DMT, which improves significantly on known bounds. Finally, in Section 
IIX-AI we give explicit CDA based codes of low complexity for all the DMT optimal protocols. 

III. Techniques and General Results for Cooperative Networks 
A. Amplify and Forward Protocols 

We consider only amplify-and-forward (AF) protocols in this paper by which we mean that relays are allowed to 
perform only linear processing on their received signals prior to transmission. In particular, they are not permitted 
to decode and then re-encode. 

In all of our protocols, we assume that the relays perform the simplest form of linear processing; transmission 
upon scaling the incoming by an appropriate constant to meet a transmit-power constraint. H Furthermore, it is 

2 More sophisticated linear processing techniques would include matrix transformations of the incoming signal. 
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known [5], that this constant does not matter in the scale of interest. Therefore, without loss of accuracy, we will 
assume that this constant is indeed 1. 

It follows that, for any given network, we only need specify the schedule to completely specify the protocol. 
Once the schedule is specified, each node transmits the last received signal in the next time instant in accordance 
with the schedule. This will create a transfer matrix between the signal transmitted from the source and the sink, 
with the noise being no longer white. To compute the DMT offered by the protocol, we need to compute the DMT 
of the equivalent channel y = Hx + w, where H is the effective transfer matrix and w is the noise vector, which 
is potentially colored. 

In this section, we will develop techniques to handle non-white noise and a general method to compute lower 
bounds on the DMT of matrices with certain structure. 



B. The Information Flow Diagram 

We begin by introducing the notion of an information-flow (i-f) diagram as a means of characterizing the the 
mutual information between the source and the sink in a ss-ss relay network. A ss-ss relay network will have many 
paths between the source and the sink, including a direct link. Protocols employed in a wireless network need to take 
into account the half-duplex, interference and broadcast constraints at each of the nodes. Due to the complexity of 
the network graph, it is in general difficult to characterize the network information-fheoretically under the wireless 
constraints. The i-f diagram, that we propose, is an attempt to abstract out the details of network graph, and to 
focus our attention only on the mutual information between source and sink, given a protocol. 

As will be seen, the i-f diagram is well suited to studying amplify and forward relay networks. 




Fig. 6. Single Relay Channel 



Example 1 Consider a ss-ss, single -relay scenario, operating under the Non-orthogonal Amplify and Forward 
(NAF) protocol of [5] (Fig©. This is a two slot protocol, wherein during the first slot, the source transmits to both 
relay and sink. During the second slot, the relay re-transmits the information that it received during the first time 
slot, while the source transmits new information at this time. Let us represent the random vectors associated to 
source transmissions at time slot one and two by xi , X2 and the corresponding data received by sink in the two 
time slots by yi, y 2 - 

Then the input-output relation takes on the following form 



y = Hx + n, 



(3) 



where 



n 



H 



Wi 

h 2 v + w 2 

gl 
g2h 2 gi 

Xl 

x 2 J ' 

yi 
y2 
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Given that H is known, the covariance matrices of the noise and signal vector are denoted by 

S n := E(nn t ) 

" ol, 



and 



ai + \h 2 \ 2 al 



S x := E(xxt), 



where a 2 , o\, denote the variances of the corresponding noise vectors. We will assume a 2 = = 1 without loss 
of generality since the exact value does not matter in the scale of interest. 
We represent the induced channel by the i-f diagram in FigjT] 



{ H 8 , 1} 




{ H s , 1} 



Fig. 7. Information flow diagram of single relay channel 



In the i-f diagram in FigJTJ we have used the subscript s denoting straight coupling, and subscript c denoting 
cross coupling. So the following equivalence holds. 



H d 



gi 

g2h 2 

1 + |h 2 | 



The interpretation of the arrows in the i-f diagram is illustrated in FigH] and Fig|9] 



{H, I J 



V1 



(a) A single link in i-f diagram 
Fig. 8. Equivalent channel model of a single link in i-f diagram. 



z [I = £(zz f )] 



(b) Equivalent channel model 





[I ( =E((z k+ z )(z k tz )<)] 





H 2 


— ( 









(a) Multiple access links in i-f diagram (b) Equivalent channel model 
Fig. 9. Equivalent channel model of multiple-access links in i-f diagram. 
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The two tuple notation (H, S) for each link is used to specify the channel matrix for the signal and the noise 
covariance matrix. H is the channel matrix and therefore the transmitted signal x is multiplied by H to give Hx 
at the receiver. The noise is potentially correlated because it is accumulated over multiple links, in such a way that 
the noise added on one link gets multiplied by the channel matrix of the next link. 

The input output relation for the single link in the i-f diagram Fig{8] is explained as follows: 

yi = Hxi + z 

where z is a complex gaussian random variable, with T, z = E(zz'). 

The input output relation for the multiple links terminating in a given node in the i-f diagram Figj9] is explained 
as follows: 

N N 

y = ^2 HiXi + z i + z o 

i=l i=l 

where z\ and zo are independent complex gaussian random variables, = E((zk + zo)(zk + zo)^). 

C. White in the scale of interest 

In this section, we provide two lemmas that will be extensively used in all future sections: Lemma 13.11 which 
states that noise, even though correlated can be treated as white in the scale of interest and Lemma 13.21 which 
proves that i.i.d. gaussian inputs are sufficient to attain the outage exponent of any channel of the form y = Hx + w. 

Lemma 3.1: Consider a channel of the form y = Hx + z. Let H, Fj,j = 1,2, ..,L be n x n independent 
random matrices, with entries in each of the matrices being i.i.d. random variables with complex Gaussian CAA(0, 1) 
distribution. Let Gi,i = 1, 2, ..,M comprise of finite products of various matrices from the set of Fj. Let z = 
zo+X^fci GiZj. Let {zj} be i.i.d. circularly symmetric n-dimensional complex Gaussian CA^(0, /) random vectors. 

Then z is white in the scale of interest, i.e., 

1) X-i = p° Vi with probability one, where Aj are eigenvalues of the noise covariance matrix S. 

2) logdet(7 + pHITXr 1 ) = logdet(J + pHH" 1 ") with probability one. 

3) Pr (log det(7 + pHH+XT 1 ) < r log p) = Pr (log det(7 + pHH 1 ") < r log p) 

Proof: For a fixed set of values of Fi and H, the noise covariance matrix is given by, 

S = 8[zz ] ] 

M 

= I + Y,GiG\ (4) 

i=l 

Let \i(A), \max{A) and \ m i n {A) denote the zth, maximum and minimum eigenvalues of the positive semi- 
definite matrix A. If the context is clear, we may avoid specifying the matrix, and just use Aj, A max and X m in 
respectively. 

By Theorem 6.1.1 in [34] due to Gersgorin, each eigenvalue of E, when properly ordered, is bounded within 
the interval 

S« - Ri{T) < Ai(E) < £« - Ri(Y,) where, (5) 
-Ri(E) := E™=ijyj|Ejj| 

For 2 = 1,2,..., M, let Gi be a product of ni matrices from the set {Fj : j = 1, 2, . . . , L}, and let them be 
labeled as i 7 ^ , j = 1, 2, . . . , n^. Let Fij(k, I) denote the (A:, Z)th entry of the matrix Fjj. Note that each of Fij(k, I) 
~ C7V(0, 1). Also, let Gi(k,l) denote the the (k,l)th entry of d. Then, 
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£« = i + ^2^2\G e (i,k)\ 2 (6) 

e k 

= \^2^2G e (i,k)Gl(k,j)\ 

e k 

= I EE G ^» i 

e k 

^ EEi G * fe ) \\ G ti,k) i (V) 

£ fc 

Now every Gg(i,j) is a polynomial function of C7V(0, 1) entries of Pf m , m = 1,2, ... ,ri£. Define a random 
variable v such that | Ge(i,j) | 2 = p~ v . Now we will prove that v > with probability one, for every I, i and 
j. Let v denote a realization of the random variable v. It can be proved that polynomial functions of independent 
random variables that have finite mean and variance have finite mean and variance. Therefore E(| Gg(i,j) | 2 ) is 
finite. 

Let the pdf of v be p v (v). We have to prove that P(v < 0) = 0. Suppose we have proved that P(v < — i) = 0, 
for all n E W, then we have: 

P(v<0) = P{u- =1 {v<-i}} 

n 



< 



oo ^ 

y>(v<-- 

n=l 



oo 



n=i 




Now we will prove that indeed P(v < — ^) = 0, Vn. Now, for any given n and /?, 

oo > E(| Gi(i,j) | 2 ) 
= E(p" 



+oo 



P v p v (v)dv 
> I p~ v p v (v)dv 



> p n p v (v)dv 

J — oo 

= p^P(v<--) 



n 



Taking limit as p tends to infinity on both sides 



oo > lim p«P(v < ) 

p->oo n 

This can only imply that P(v < — -) =0 since otherwise, the RHS will grow to infinity as p tends to infinity. 
Hence with probability 1, 

| G e (i,j) | 2 = p~ v with vv > 0. (8) 
By equations ©, (H), © and (O, it follows that with probability one, the following equations are true: 

Ai(E) < 1 + ^-" 
= p° Vi 

^Xmax < P° (9) 
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We now provide a lower bound for each Aj(£). Let ej be the eigen vector corresponding to Aj(£). Then, 

\i II 67 II = Ci^YjCi 



M 



ei II 2 W l^GiGUei 



> II ^ || 2 



=> A< > 1 Vi 

^A min > p° (10) 

By (O, (fTOl . we have that with probability one: 

A* = p° Vi (11) 

To prove the second assertion of the lemma, we use the Amir-Moez bound on the eigen values of the product 
of Hermitian, positive-definite matrices [36]. By this bound, for any two positive definite n xn Hermitian matrices 

A,B: 



Xi(A)X min (B) < Xi(AB) < X i (A)X max (B) 

So we get, 

det(/ + pAB) = + pXi(AB)) 

i 

< l[(l + P^(A)X maK (B)) 

i 

= det(J + P X m&x {B)A) 

Similarly, 

det(I + pAB) > det(J + pX min (B)A) 

Therefore, 

det(I + pX mm {B)A) < det(I + P AB) < det(/ + P X m ^(B)A) (12) 
Applying (O to A = HH^ and B = XT 1 , we get 

^ &et{I + P HH ] X min {^- 1 )) < det(J + pHH ] ^~ l ) (13) 

< &et{I + P HH^X max (Y,- 1 )) (14) 

Since the eigenvalue of £ and of S" 1 are reciprocals, it follows that A max (S _1 ) = A m j n (£) = and 
A m j n (£ -1 ) = A max (S _1 ) = p° with probability one. Hence, we have with probability one, 

det(I + pHH^' 1 ) = det(/ + pHH^) (15) 

This proves the second assertion of the lemma. 
Continuing from (fT3l) and (fT4l) . we have 

Pr{log(det(/ + p^ t A mi „(S^ 1 ))) < rlogp} > Pr{log det(7 + pHH^~ l ) < rlogp} 

> Pr{log(det(/ + p^FtA max (S- 1 ))) <rlogp} (16) 
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In the following, we will prove that both the bounds coincide as p — > oo. We begin with the bounds on A m j n (£) 
and X max (T,). By (01), we know that 

Amin(S) > 1 

Amax(^ 1 ) < 1 

Hence, Pr{log det(J + pHH^^ 1 ) < rlogp} > Pr{log det(I + / ofrff t A maa .(E -1 )) < rlogp} 

> Pr{log det(J + pHH ] ) < r log p} (17) 

Now bounding A maa; (£), 



i=l 
A/ 

= 1 + A 

(E^l) 



i=l 
M 

i=i 

M 



l + ^TriGiG] 
i=i 

M 

1 + EllG* 



1=1 

Af 

|2 

8=1 

M m 

< i+ eii 11^-11% ^ 

i=l j=l 

< f(u 1 ,u 2 , ■ ■ ■ ,u s ) (19) 



Now, it follows that RHS of 1181) is a multinomial in random variables m, U2, ■ ■ ■ , us with constant term 1 and 
non-negative integer coefficients. Here, each u,i is the squared norm of a CN(0, 1) random variable, and therefore 
has a exponentially distribution. 



c e u- 



f(ui,u 2 ,...,u s ) = E ( 

where e = (e±, e%, . . . , eg) € E C Z + s , \ E \ < oo 



Clearly, 



P 

f(u 1 ,u 2 , ...,u s )>p e => 3 e s.t. u-> — , where 

Te is the number of terms in the multinomial. 

Pr{/(ui,« 2 ,...,u s ) >p e } < ^ W«->f) (20) 



Now we evaluate a single term in the RHS of (1201 . Define T := max e T f 
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Pr[u->^r) < SPr( Ul > 



TJ ~ V SGTe 



< Pr Ui > 

V SGT 

= Pr (iii > ap € ) 
= exp(— ctp e ) 

Pr ^u- > j^j < exp(-ap e ) 

where G is the maximum degree of / in any variable and a is a constant. 
Continuing with (1201) . 



Pr{f(ui,u 2 , ■ . . ,u s ) > p 6 } < ^exp(-ap') 

e 

= \E\ exp (— ap e ) 
= exp (— ap £ ) 



So we have, 



Pr{f(ui,u 2 ,...,u s )>p e } < exp(-ap') 
Pr{A moa! (S) > p 6 } < exp(-ap e ) 

Let TL denote the support of all the fading coefficients in the network, and let h G TL denote a realization of the 
fading coefficients. Clearly, once a h is given, the values of the matrices H, Gi and Fi are all well defined. 

Let A = {h G H | logdet(J + pHH^' 1 ) < p r } and B = {h G TL | A maa; (S) > p e }. Then, 

Pr(A) = Pr(ADB c ) + Pr(AnB) 

< Pr(Ar\B c ) + Pr(B) 

< Pr(A n B c ) + exp(-ap e ) (21) 
Now, A C {heTL\ log det(J + pH ^^{TT 1 )) < p r } 

= {h£H\ log det(7 + pHH\\ max (Z)y l ) < p r } 

A n B c C {h£H\ log det(J + p^HH^) < p r } (22) 



log Pr(A) log{Pr(^ fl P c ) + Pr(B)} 

log p log p 

log{Pr(/i G TL | logdet(I + p^HH^) < p r ) + exp(-p e )} 



< 



logp 



Hm log Pr(A) ^ nm \og{Pr{h G H | logdet(J + p^HH}) < p r )} 

p^oo log p ~~ p^oo log p 

The last equation follows since the first term in the RHS is polynomial in p whereas the second term is exponential 
and therefore the sum is dominated by the first term. 

After doing the variable change, p = p 1_<: and using the variable p itself in place of p , 

lim log Pr (A) ^ Hm log{Pr(h G TL | logdet(J + p_HH}) < p ( ^ } } ^ 

p^oo log p p~*oo log p 

In (1241) . e is arbitrary, and we tend it to zero. Hence, by (1241 and ( fTTT ), the exponents for both the bounds in ( fT6b 
coincide and hence we get, 

Pr{log det(J + pHH^T,- 1 ) < r log p} = Pr{log det( J + pHH ] ) < r log p} 
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This proves the third assertion of the lemma. ■ 
Lemma 3.2: [3] For any channel that is of the form y = Hx + w with w being white gaussian noise, i.i.d. 

gaussian inputs are sufficient to attain the best possible outage exponent of the channel. 

Proof: Proof is available in [3]. We sketch the outline of the same proof for completeness. The outage probability 

is given by, 

P out (R) = inf Pr{/(x;y | U = H) < R} 

S x : Tr(E x )<P 

inf Pr{log det(J + pHZ x H r ) < R} 

S x : Tr(E x )<P 

If x, y G C m , then the outage probability can be bounded below and above as, 

Pr{logdet(I + —HH^) < R} 

m 

> Pout(-R) > Pr{log det(/ + pHH*) < R} 
As p — > oo, it can be shown that the bounds are tight and hence we get (Equation (9) in [3]), 

P oul (R) = P (log det(I + pHH^) < R) (25) 



Remark 6: Because of Lemma 13.21 it is sufficient to consider i.i.d. gaussian input distribution for characterizing 
the outage exponent. Also, for characterizing outage exponent, we are allowed to assume that the noise is white in 
the scale of interest (see Lemma |3~TI ). It can be verified that noise that we deal with in this paper is always satisfies 
the conditions in Lemma 13.11 Hence we will make these two assumptions throughout the paper 

• Signal is distributed as i.i.d gaussian. 

• Noise is white in the scale of interest. 



D. A DMT Lower Bound 



Definition 6: Consider a set of N x Nj matrices Aij , j 
block matrices in the (z, j )th position, i.e., 



.4 



An 



o 

-422 



1, 2, N, i > j. Let A be a matrix comprised of the 








A N1 A 



N2 



A 



NN 



We will call A as a block lower-triangular matrix. Define the l-th sub-diagonal matrix, Ag of a block lower 
triangular matrix A as the block lower triangular matrix comprising of entries An, A^ + ^ 2 , A^ +N _^ N and 
zeros everywhere else i.e., 



(Ai)ij = A^ if i-j = l-l, else (Ai)ij = NiXNj . (26) 

The last sub-diagonal matrix of A is defined as the sub-diagonal matrix Ai of A, with the maximum I such that 
Ae is a non-zero matrix. 

Theorem 3.3: Consider a block lower triangular random matrix H made of matrices Hy of size Ni x Nj. Let 
M := Eili ^ be the size of the square matrix H. Consider a channel of the form y = Hx + w, where H is the 
M x M block lower triangular random matrix, x, y, w are M x 1 vectors. Let w be a noise vector, which is white in 
the scale of interest. Let Xi, y\, Wi be vectors of length Ni such that x = [xi, X2, . . . , xn] t , y = [yi, y2, . . . , yisr] T 
and w = [wi, W2, . . . , wn] t . 

Let Hd be the block-diagonal part of the matrix H and denote the last sub-diagonal matrix of H, as per 
Definition [6] Then 
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1) d H (r) > d Hd (r). 

2) d H (r) > d Hl {r). 

3) In addition, if the entries of Hg are independent of the entries in H^, then dn{r) > dn d {r) + dH e {r) 
Proof: The channel is given by y = Hx + w. Since the noise is white in the scale of interest, by Lemma |3~T1 

the DMT of this channel is the same as that of a channel with the noise distributed as CN(0, 1). Therefore, without 
loss of generality, we assume that w is distributed as CN(0,/). 

We have the block-diagonal part of H, H d = diag{H\\, H22, ■ ■ ■ ,Hnn} and the last sub-diagonal matrix He 
contains N — I + 1 non-zero entries {Hu, -£fy+i)2, H N ^ N _i + ^} in the l-th sub-diagonal. 

The outage probability exponent [3] is given by 

p~ d ^ = inf Pr { J(x; y : H = H) < r log p} 

In order to evaluate this exponent, we first evaluate the mutual information. Let us assume that the input x is 
distributed as CJ\f(0,I). By Lemma I3T21 this input distribution is indeed DMT optimal. We will compute the 
mutual information terms under this assumption that the inputs are iid gaussian. 

See FigfJO] for the i-f diagram. Now, we proceed to find a lower bound on the DMT of the protocol. 

Consider the following series of inequalities for all i = l,...,N. 

I(x i ;y|H = if,x 1 f 1 ) > J(x i; yi |H = H, x'f 1 ) 

= I(xi; HiiXi + H i(i _ 1 )X i _i + ... + H i(i _ £ )Xi_£ + w ; |H = H, x 1 ^ 1 ) 

= /(xi.-iJjjXi + fl i(i _ 1 )X 1 _ 1 + ... + i7 i(i _£)Xi„ £ + w;|H = #,xi -1 ) 

= 7(xi; HuXi + iJj(j_i)Xi_i + ... + i7 i(i _£)Xi_^ + w ; |x!f 1 ) 

= /(xi; H^Xi + Wjlx'jf 1 ) 

= I(xi; Huxi + wi) 

The last step follows since {xi} are independent. 



M 

^/(x;y|H = F) = ^/(x i ;y|H = J ff,xi- 1 ) 

i=l 
M 

> ^ J(xi; iJ^Xj + Wj) 

i=l 

> /(x;H d x + w|H d = J ff d ) (27) 

In the above, whenever the index of a variable is not positive, we assume that the variable is not present in the 
conditioning, in order to simplify the notation. 
Now by equation (|27T ), 

p -d H {r) _ p r {/( x;y |H = H) < rlogp} (28) 
< Pr{/(x; H d x + w|H d = H d ) <r log p} 

d H (r) > d Hd (r) (29) 
We have another series of inequalities for all i = 1, M — 1. 
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Fig. 10. The i-f diagram for the block lower triangular channel matrix. 



7(x i _^;y|H = 77,x^ +1 ) > I(xi_< 

= 7(x ; _^ 



y i |H = F,xf J m ) 
HiiXi + H i ( i _ 1 )X i _i + ... + H i(i _ f) Xi_£ + wi|H = 77, xf_ e+1 ) 
HaXi + Hi^yxi-i + ... + iJj(j_^Xi_^ + Wi|H = H,x^_ e+1 ) 
HiiXi + ff i ( i „ 1 )X i _i + ... + Hi(i_^x.i-i + Wi|x[^ m ) 
Hi(i-i)Xi-e + Wi|x^ +1 ) 
Hi(i-i) x i-e + Wi) 



7(x;y|H = 77) 



= ^/(x i ;y|H = J f/, X N 1 ) 

i=N 
l+l 

> ^/(xi_,;y|H = F,x^ +1 ) 



i=N 
l+l 



Now by equation (|27T ). 



i=JV 

= J(x;H*x + w[H< = fli) 

-*r(r) _ p r {/( X ; y |H = 77) < r log /?} 

< Pr{J(x;H/x + w|H< = fl- < )<rlogp} 



d H (r) > d Hd (r) 



Therefore, 



7(x;y|H = 77) > max(7(x; H d x + w|H d = H d ), 7(x; H^x + w|H £ = H t )) 
The outage probability exponent [3] is given by 



P 



-d(r) 



v in| ^ ro -P?-{7( x ;y I H = 77) < rlogp} 



Now by equation (1331 ). 



(30) 
(31) 

(32) 
(33) 



21 



P 



r d «( r ) 



Pr{/(x;y : H = H) < rlogp} 

Pr{max(/(x;H d x + w|H d = H d ), I(x; H^x + w|H £ = H e )) < rlogp} 
Pr{/(x; H d x + w|H d = < r log p, 
J(x; Hpc + w|H £ = J%)) < r log p} 
Pr{/(x;H d x + w|H d = H d ) < rlogp} 

xPr{I(x; H^x + w|H^ = Hi) < r log p} 

p- d H d (r) p -d Hl (r) 

p~ d H d (r)+d He (r) 



(34) 



< 



> 



(35) 



where the first step comes about because of the independence of the entries in H d and H^, which is indeed the 
case because of the assumption that all the fading coefficients in the system are independent. The second step is 
because iid complex gaussian inputs are optimal in the scale of interest. 

■ 

Corollary 3.4: Theorem 13 . 3 1 holds even for the case when the matrix H is block upper-triangular instead of block 
lower-triangular. 

Proof: Follows from the proof of Theorem 13.31 since the DMT of a matrix H and its transpose H T are the 
same. ■ 
Remark 7: The following two matrix inequalities can be deduced from the proof of Theorem 13-31 with Hd and 
Hi defined as in the theorem: 

det{I + pHH ] ) > det(I + pH d H d ) 

and det(I + P HH ] ) > det (I + pH t H\) 
Remark 8: The DMT of a matrix H is greater than or equal to the DMT of the block diagonal matrix H d . This 
bound will be most frequently used whenever we recall Theorem 13.31 

E. Example Applications of the Main Theorem 

In this section, we recover lower bounds on DMT of various existing amplify and forward protocols. While these 
are already known, the derivations presented here are surprisingly simple and they lead to intuitive explanation of 
how these protocols achieve the DMT. 

Example 1: Single Source, Single Sink, Single relay, NAF protocol 

Consider the relay network in Fig|6j considered in Section IIII-BI The i-f diagram is given in FigJTJ 



y = Hx + n, 



(36) 



where 



H = 



gi o 
g2h 2 gi 



n 



Wl 

W 2 + /l 2 V 



Since two time instants are used in order to obtain the equivalent channel matrix, we have a rate loss by a factor 
of 2, and hence d(r) = dn(2r). It can be checked that the noise vector n satisfies the conditions in Lemma |3~T1 and 
therefore is white in the scale of interest. Now it is sufficient to study the DMT of the matrix H. Let Hd = H 
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where (g) denotes the Hadamard product (entry-wise product) of matrices. Let Hg denote the matrix with only the 
lower triangular entry and set all other entries to zero, i.e., 



H d 



gi o 
gi 


g2h 2 



The fading coefficients gi, g2, 1*2 are independent and therefore Hd is independent of H^. We use Theorem 
and we get that: 



It is easy to evaluate dn d 



d H (r) 


> d Hd (r) 


+ d Hl (r) 


and dH e (r): 






dH d ( r ) 


r 


+ 


d He (r) 


= (i-O" 


f 


=> d H (r) 


T 

> 1-- 

- V 2' 


+ + (l-r 



We can get the DMT of the protocol as 



d(r) 



d H (2r) 



=>d(r) > (l-r) + + (l-2r) + 

From [5] we know that this bound is indeed tight. However, we will not proceed to find an upper-bound here. 

Example 2: Single source, Single sink, Multiple relays, SAF 

Consider the network in FigfTJwith N relays. We employ an M-slot amplify-and-forward protocol termed Slotted 
Amplify-and-Forward (SAF) introduced in [17]. Each of symbols transmitted by the source reach the sink through 
the direct link, and through a relayed path. For the case when relays are isolated from each other (see [17] for 
a description), the induced channel matrix for a M slot protocol is given by a M x M channel matrix, with g^, 
the fading coefficient of the direct link, along the diagonal and g±, . . . , g^, the product coefficients on relay paths, 
repeating cyclically along the second sub-diagonal. Let M = kN + 1 be the slot length, with k a positive integer. 

For example, for M = 5, N = 2, k = 2 case, the induced channel matrix is given by: 



H := 



gd 

gi gd 

g 2 gd 

gi g d 

g 2 g d 



See FigfJTJfor the i-f diagram, where := g^, Hi := gu-i mo d 2)+i> Si = 1 + \fi\ 2 - Since the channel is used 
for M time slots, we have the relation d(r) = dn(Mr) between the DMT of the protocol, d(r), and the DMT of 
the matrix du{r) . Now, we proceed to find a lower bound on the DMT of the matrix. 

Let Hd = gd/ be the diagonal matrix corresponding to H. Let Hi be the second sub-diagonal matrix corre- 
sponding to H. It contains gi, ...,gM each for k times in the second sub-diagonal. From Theorem 13.31 the DMT 
of H can be lower bounded as: 



dn(r) > dH d (r) + d He (r) 
We already have d(r) = dn{Mr) 

^d{r) > d Hd (Mr) + d Hi (Mr) 

Now the DMT of the matrices Hd and H^ can be easily derived as: dn d {r) 



(37) 
(38) 
(39) 



r • 
M ■ 



and d He ( r ) 
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Fig. 11. The i-f diagram for M-slot SAF protocol. 



d{r) > (1 - r)+ + N(l - — — -r) + (40) 

The right hand side is infact shown to be equal to the DMT of the SAF protocol in [17]. 

Example 3: Single Source, Single Sink, Multiple Antenna, Single relay, NAF protocol 

Let us first consider a single relay network with the source, the relay and sink equipped with multiple antennas 
n s , n r , n^. Let us use the NAF protocol [5] in this scenario, as is done in [15]. The channel matrix turns out to be 



H := [* d H ° (41) 

where Hd is the x n s fading matrix between source and the sink, is the product fading matrix of an 
n r x n s matrix between the source and the relay and an n,i x n r matrix between relay and sink. Proceeding in the 
same manner as in Example 1, we can get that d(r) > dH d { r ) + dH e (2r), where dn d (r) is the DMT of the direct 
link matrix Hd, and dn e (r) is the DMT of the product matrix Hg. This lower bound was derived as Theorem 1 of 
[15]. 

Let us now consider a generalized NAF protocol (see [23]) where, for the first T time instants, the source 
transmits to the relays and then the relays transmit a linear transformation of the received vector over the T time 
instants. Even in this case, the input output transformation can be represented using a equation of the form (I4TT ). 
However H is now a 2Tn^ x 2Tn s matrix, Hd is a Tn s x Trid block diagonal matrix with the direct link fading 
matrix repeated T times and He is any Trid x Tn s matrix (which depends on the linear transformations used at 
the relays) relating the inputs to the output at the sink due to the relaying path. Let dc{r) := dH e (Tr) denote the 
DMT of the same scheme used without the direct link and with full duplex relays. Let cZo(r) := dn d {Tr) denote 
the DMT of the direct path fading matrix. 

Then Theorem 13.31 can be used to get the following inequality for the DMT of this generalized NAF scheme: 

d(r) > d D (r)+d c (2r) 

This proves Conjecture 1 of [23]. 

Example 4: Single Source, Single Sink, Multiple Antenna, Multiple relays, NAF protocol 

In [15], the authors consider a two-hop relay network with a direct link and N relays. Consider the NAF protocol 
for the N relay case suggested in [15] in which each path is used for equal duration. Here we consider a general 
version of the NAF Protocol, where different relaying paths are activated for different fractions of time. Let the 
relaying path through relay i be used for f\ fraction of the time. For this protocol, let us derive the DMT. The 
matrix connecting the input and the output is a block lower-triangular matrix with the direct-link fading matrix Hd 
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repeated on the block-diagonal. The second sub-diagonal contains entries matrices R\,R2, . . . ,Rn> where R4 is 
the product matrix along the ith relay. We can bound the DMT of resulting matrix using Theorem [ 



d(r) > d Hd {r) + d c {2r) (42) 

where dc{r) is the DMT of a parallel channel with entries R4 occurring for a fraction /j of the time. We can 
evaluate dc(r) explicitly from the DMT di(r) of the product channel R4. 

The DMT of this channel can be computed using the parallel channel formula given in equation Equation (1551 ) 
in Lemma 13.81 and it is given by, 

K 

d c {r) = sup inf 52 ^fa) (43) 

where di{r) is the DMT of the product channel in the zth channel and corresponds to the DMT of the product 
matrix GiHi. 

Therefore the overall DMT is given by 

K 

d{r) > d Hd {r)+ sup inf fa) (44) 

(fuh,-,M (ri,r 2 ,- ,r K ): Ef =1 f^=r i=l 

As a particular choice, if fa = 1/N for all i, then 

K 

dc(r) = inf J2 d i fa) ( 45 ) 



Let 0j := j4-. Then we have 



{r u r 2 ,- ,r K ): J2 i= i r,=Nr . =± 



K 

d c (r) = inf Y^diiNOir) (46) 

(6i A,- ,9k): Efli 9*=1 ~l 



We plug this equation into (1421 ) and get 

A' 

d{r) > d Hd {r)+ inf V^(2^r) (47) 

0*): Ef =1 ft=l ^ 

which is indeed the formula in Theorem 2 of [15]. However the lower bound on DMT that we have in Equation (1441 
is better than the lower bound in Theorem 2 of [15] since we allow for arbitrary periods of activation which is a 
more general approach. 

Remark 9: In the notation of [15], dn d {r) = dp(r) since F is the matrix of transformation between source and 
sink through the direct link. Also Gi is the matrix between source to relay i and Hi matrix between relay i to sink. 
According to notation of [15], dciHA 7- ) i s trie DMT corresponding to the product matrix GiHi. 

F. DMT of elementary network connections 
1 ) Parallel Network : 

Lemma 3.5: Consider a parallel channel with M links, the each link being represented by yi = HiXj + Wf, and 
let the optimal DMT of the ifh link be Then the optimal DMT of the parallel channel is given by 

M 

d{r)= inf yZdtin) (48) 

(ri,r 2 ,~ ,r M ): J2tLi r '= r i=1 

Proof: The input-output relation of the parallel channel is given by 
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x •- 



Fig. 12. The Parallel channel with M sub-channels 



J(x;y|H = #) 



yi 








Xl 






H 2 




x 2 






1 1 \i 




XM 



+ n 



M 



= h(y\H = H)-Y,Hyi\yi 1 ,x,'K = H) 

i=l 
M 

= h{y\H = H)-^ / h(y l \x l ,H = H) 

i=l 

M M 

< ^2h(y l \H = H)-J2h(y i \x i ,H = H) 

i=l i=l 
M 

= ^(y,|H = J ff)-^(yi|x 1 ,H = fO] 

i=l 
M 

= ^/(x i ;y i |H = J fY) 

i=l 

M 

= ^J(x ! ;yi|H 1 = fli) 



i=l 



(49) 



(50) 



^Pr{/(x;y | H = H) < rlogp} < Pr{J^ J( Xi ; yi |Hi = H { ) < r logp} 

i=l 

The equality in the last equation occurs if all the Xi are independent. So we will choose the Xi to independent, for 
the rest of the discussion, since this maximizes the mutual information and hence minimizes the error probability. 
Define Z\ := i"(x;;yi|Hi = Hi). Now Z\ is a random variable which depends on the realization of the channel. 
Since {Hi} are independent, {Zi} are also independent. Let Ri = rj log(p) and R = r log(p) for i = 1,2. 

Now our goal is to evaluate P{ Y^iLi Zi < rlog(p) }. To do this, first we consider the case when M = 2 and 
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we evaluate P{ Zi + Z 2 < rlog(p) }. Then we extend this to general M by induction. 

F Zi (Ri) ■■= P{Zi<Ri} 

fz ' {Rl) := lk FzXRi) 
LetF Zi (Ri) = P ~ Mn) 



p 



dri 

-di(n) 



P(Z! + Z 2 <i?) = /T d M 

/■oo 

= / fz 1 {R\)Fz 2 {R - R\)dR\ 
Jo 

poo 

= / p~ dl{Tl) p' d2{r - ri) ln{p)d{r 1 ) 
Jo 

By Varadhan's Lemma [30], the SNR exponent integral can be evaluated in the scale of interest as: 



d(r) = inf di(ri) + d 2 (r — ri) 

ri>0 



inf V^r;) 

i,r 2 ): ri+r 2 =r * 



(n.r,)- ..... . , =1 

Now, consider the general case with M parallel channels 

M 



P{Y, Z *< rl °9(p)} 



-d(r) 

i=l 

Proceeding by induction, we get: 

M 

d(r) = inf V^r;) 

(ri,r 2 ,-,r M ): 2_, j=1 r«=r i=1 

■ 

Remark 10: The following lower and upper bounds on the outage exponent are immediate from Equation (l48l) : 

d(r) < J>(£) (51) 

i=l 

M 

d(r) > ^di(r) (52) 
i=l 

We recall the following Lemma from the theory of majorization [32]: 

Lemma 3.6: [32] If /(.) is a symmetric function in variables n, r2, • • • , rjv and is convex in each of the variables 
r»,i = 1, 2, . . . , N, then, 

/ r r r \ 

inf /(n ,r 2 ,..., tat) = /(——,...,— : ) (53) 

Lemma 3. 7: The DMT of a parallel channel with all the individual channels being identical and having a convex 
DMT is given by: 

d(r) = Md!^) (54) 
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Proof: Consider YlZi d i( r i) 

as a function of the variables r\. Then the function satisfies the conditions of 

Lemma [331 Therefore, 

M 

d(r) = inf Y^d;(r;) 

(ri,r 2 ,---,r M )- £» = i r«=r i=1 
M 

= Tdi(—) 

i=l 

■ 

2) Parallel Channel with Repeated Coefficients: 

Lemma 3.8: Consider a parallel channel with M links with repeated channel matrices. Let there be N distinct 
channel matrices H^ l \ H&\ H^ N \ with ifW repeating in n-i sub-channels, such that J2iLi n i = M. Let fi = 
Then the DMT of the parallel channel is given by, 

N 

\di(n) (55) 



d{r) = 

{ri,r 2 ,- 


inf 


x, • 


H (D 


x 2 • 








X n1* 


H (2) 


X n1+1 


H (2) 


x • 

A n1+n2-1 




x • 


H (N) 



yi 
y 2 

y„i 

Yn1+1 



Fig. 13. The Parallel Network with repeated coefficients 

Proof: Following the same line of arguments in the proof of Lemma 13.51 choose Xi to be independent. For 
computing the DMT, we know from Lemma [3^21 that the inputs can in fact be independent and identically distributed 
with a CN(0, I) distribution. So we have 

M 

7(x;y|H = ff) = ^ 7(x i; y^H; = 

i=i 

M 



P{/(x; y|H = H) < rlogp} = P{J] J(x,; yi|H| = Hi) < rlogp} 

8=1 

JV 

= P{^2 ^7(xi;yi|H; = Hi) < rlogp} 



i=l 
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Now, define := riji"(xi; yi|Hj = Hi). Also let 

p-«T) = P{Z t <rlog{p)} 

= P{/(x i; yi |Hi = Hi) < (-)log(p)} 



Hi 



= P 

where, p~ d ' {r) = P{p(H {i) ,p x ) < rlog(p)} 



Using the same convolution argument in the proof of Lemma [ 

N 

d(r) = irif J2 d 'i( ri ) 

(ri,r a ,-,r N ): T. i= i r ^= r i=l 
N 

mf y>(-) 

{ri,r 2 ,— ,r N ): Y^?=i r i= r ; =1 n « 

AT 

= inf y~]di(ri 



G. Achievability of outage exponent 

In all the above derivations, it was assumed that the outage exponent was equal to the DMT. It needs to be 
shown that the outage exponent can indeed be achieved. We first give a simple compound channel argument for the 
achievability, similar to the argument in [11]. Consider a compound channel, where a channel, s is chosen from a 
set of possible channels S and the channel remains fixed. Then the capacity of the compound channel is given by 

C = sup inf /(X;Y|S = s) (56) 

p x {x) sG(S) 

If the maximizing input distribution p* x (x) is the same for all possible channels s € S, then 

C = inf C s , where 

ses 

C s := /(X;Y|S = s) 

evaluated for p* x (x), which is indeed the capacity of the channel s. 

Consider the set of all channels not in outage, TL. Then TL is defined as 

H = {H : /(X; Y|H = H ) > rlog(p)} (57) 

If the optimizing distribution is independent of H in H, then the capacity of the compound channel H is given 
by C = rlogp. 

This means that there exists a code for this compound channel, whose probability of error is less than e for any 
given e > 0. The probability of error of this code when used on the slow fading channel is given by 



Pe — -fout-fe/out + Pout" Pe / out" (58) 

< -Pout + Pel out" (59) 

< Pout + e (60) 

< Pout (61) 



where P out is the probability of the channel being in outage and P out c is the probability of the channel not being 
in outage. P e /out is the probability of error of the code given the channel is in outage and P e / ou t<= is the probability 
of error of the code given the channel is not in outage. Thus the outage probability is achievable if the optimizing 
distribution is independent of H. 
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Since the outage exponent optimizing distribution is iid gaussian, which is independent of H, as shown in 
Lemma 13.21 we can show that outage exponent is achievable using universal codes. It should be pointed out here 
that short approximately universal codes for the MIMO parallel channel were given recently in [13]. These codes 
indeed achieve the outage exponent of the parallel channels considered in Section 13.51 

IV. Full Duplex Relay Networks 

In this section, we consider networks equipped with full duplex (FD) relay nodes. First, we draw a general result 
on the optimum diversity of a multi-terminal network. We also provide an achievable DMT region for an ss-ss 
network with single antenna nodes. 

A. Mincut equals Diversity 

Theorem 4.1: Consider a multi-terminal fading network with nodes having multiple antennas with each edge 
having iid Rayleigh-fading coefficients. The maximum diversity achievable for any flow is equal to the min- 
cut between the source and the sink corresponding to the flow. Each flow can achieve its maximum diversity 
simultaneously. 

Proof: First we consider the case where there is only a single source-sink pair. We will prove the theorem 
in two cases: the single antenna antenna case and the multiple antenna case. We shall assume that all the fade 
coefficients are independent. 

Case I: Network with single antenna nodes 

Let the source be Si and sink be Dj. Let denote the set of all cuts between Si and Dj. 
From cutset bound [8], 



d(r) < min dn(r) 

cec i3 

=>d(0) < mm do(0) 
= : m 

where m is the number of edges in the mincut between Si and Dj. 

Sufficient to prove that diversity order of m is achievable. We know that the number of edges in the mincut 
is the maximum number of edge disjoint paths between source and the sink. Schedule the network in such a 
way that each edge in a given edge disjoint path is activated one by one. Same is repeated for all the edge 
disjoint paths. Thus, the same data symbol is transmitted through all the edge disjoint paths from Si to Dj. 
Let the number of edges in the ith edge disjoint path be m. The jth edge in the the zth edge disjoint path 
is denoted by and the associated fading coefficient be hij. So the activation schedule will be as follows: 

en,ei2,-- - ,ei( ni ),e 2 i,-- - ,%,),•■■ ,e m i,e m2 ,--- >e m (n m )- Now define hi := Yl%\hij- Let the total number 
of time slots required be N = 1 nj. 

With this protocol in place, the equivalent channel seen by a symbol is 



H 



hi 






h 2 








hr, 



If d e (r) is the outage exponent for this channel, 
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p -d c (r) ± p r {Zr =1 log(l + \h t \ 2 )<rlogp} 



= PriE^loga + JJl^l 2 ) <r\o E p} 
i=i 

= Pr{S™ 1 log(l + ^ 1 - E "=i"-)) <rlogp} 
where \hij\ 2 = p~ Ui3 

= Pr{^i l °s( l + P [1 "^= lUz3) ) < lo SP r } 

m 

= Pr([[(p^^ + )<p r } 



i=l 

Following the same lines of arguments as in [3], 



d(r)=infS^ 1 E^ =1 u <J - (62) 



A 



where 



Let = U{. Then, 



A = : S^(l - E"i 1 ^ i ) + < r} (63) 



where ^' = {u, : S^(l - < r} 
=> d e ( r ) = m — r 
Since we use N channel uses, the effective outage exponent is given by, 



d(r) = d e {Nr) 

= m-Nr (64) 

Hence the maximum achievable diversity is m. 
Case II: Network with multiples antenna nodes 

In the multiple antenna case, we regard any link between a nt transmit and n r receive antenna as being composed 
of n t n r links, with one link between each transmit and each receive antenna. Note that it is possible to selectively 
activate precisely one of the n t n r Tx-antenna-Rx-antenna pairs by appropriately transmitting from just one antenna 
and listening at just one Rx antenna. The same strategy as in the single antenna case can then be applied to achieve 
this diversity in the network. 

Fig. [141 illustrates this conversion for the case of a single source S, two relays R± and R2 and a sink D. Having 
converted the multiple antenna network into one with single antenna nodes, Case II follows from Case I. 

Thus the proof is complete for the single flow from Si to Dj. 

When there are multiple flows in the network, we simply schedule the data of all the flows in a time-division 
manner. This will entail a rate loss - however, since we are interested only in the diversity, we can still achieve 
each flow's maximum diversity simultaneously. ■ 

Definition 7: Consider a network N and a path P from source to sink. This path P is said to have an intermediate 
direct path if there is a direct link in N connecting two non-consecutive nodes in P. 

Theorem 4.2: Consider a ss-ss full-duplex network with single antenna nodes. Let the min-cut of the network 
be M = d max - Let the network satisfy either of the two conditions: 

1) None of the M edge disjoint paths between source and sink have intermediate direct paths, or 

2) The directed graph representing the network has no directed cycles. 



31 




(a) Original network with multiple antenna nodes (b) Equivalent network with single antenna nodes 



Fig. 14. Illustration: ns = no = 2, ni = n-x = 3 

Then, a linear DMT d(r) = M(l — r) + between the maximum multiplexing gain of 1 and maximum diversity is 
achievable. 

Proof: Given that the network has min-cut M, it means that there are M edge disjoint paths from source to 
sink. By the hypothesis of the lemma, we have that these edge disjoint paths do not have any intermediate direct 
paths. Let us call the edge disjoint paths e±, ei, — > &M- Let the product of the fading coefficients along the path e, 
be gi. Let D- t be the delay of each path. Let D = maxD;. Add delays D — Di to the path such that now all 
paths have equal delay. We follow the following steps in order to activate the edges: 

1) a) Activate edge disjoint path e\ for a period T, where T > D: activating all edges of the edge disjoint 

path simultaneously. This will create a transfer matrix from the source symbols to sink symbols as a 
diagonal matrix with zeros on the first D rows, and only one non-zero thread in the matrix comprised 
of coefficients equal to g\ which is the product coefficient on path e\. After this is done, the various 
nodes in the network store the data that have not yet been passed to the sink for future use. 
b) Repeat Step I. a for all edge disjoint paths e±, e^f. The net transfer matrix will comprise MD zero 
rows and one non-zero thread which contains each g\ for T — D durations. 

2) Activate all the edge disjoint paths each for time T. This time, the net transfer matrix will comprise of a 
single non-zero thread which contains each product coefficient gi for T durations. There will be no zero rows 
since all nodes always have information to transmit. 

3) Repeat Step 2 for L — 2 more times, thereby all edge disjoint paths have been activated for L times. 

Now the induced channel matrix from source to sink will comprise of MD zeros initially and on removing these 
rows we get a transfer matrix, H. d(r) = dn(LMTr). For L large, we will have d{r) = dn(LMTr). 

This matrix H will have each gi for LT — D times along the diagonal. This matrix will be lower triangular if 
none of the M edge disjoint paths between source and sink have intermediate direct paths. This matrix will be 
upper triangular if the directed graph representing the network has no directed cycles. In either case, we can use 
Theorem [33] and Corollary 13.41 we get that djj(r) > djj d (r), where is the diagonal matrix corresponding to the 
matrix H. But Hd contains LT—D entries each of g, L , therefore this matrix DMT is given by djj d (r) = dn x ( ^^ r) 
where H x = diag(gi, g M ). => d(r) = d H (LMTr) > d Hd (LMTr) = d Hl ( ^^ r). For LT tending to oo, we 
get d(r) > d Hl (Mr). Now d Hl (r) = (M - r)+ Since M = d max , we get 



=>d(r) > dmax(l-r)+ (65) 

■ 

Corollary 4.3: For the full duplex KPP networks without direct link (i.e. KPP(I) networks) and full duplex 
layered networks, a DMT of M(l — r) + which is a linear DMT between the maximum diversity and maximum 
multiplexing gain can be achieved. 

Proof: It can be easily shown that the M edge disjoint paths between source and sink for KPP(I) and layered 
networks do not have any intermediate direct path. Therefore it satisfies condition (1) of Theorem 14.21 and hence 
proved. ■ 
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V. Half duplex networks with isolated paths - KPP Networks 

In this section, we consider single-source single-sink(ss-ss) half duplex networks in which relaying paths are 
isolated(i.e., interference between the paths is absent). Every node is equipped with a single antenna. In general, it 
is assumed that half duplex networks incur a loss in multiplexing gain by a factor of 2. But we will establish that 
we can achieve the same performance in DMT with half duplex relays as that of full duplex ones, in most of the 
cases. We will show systematic ways of constructing protocols for multi-hop networks with half duplex relays. We 
will show that we can achieve the same optimal DMT of KPP networks with/without direct link. 

We first consider KPP networks in the absence of a direct link. At the end of this section we extend the results 
to KPP(D) networks. 

A. Protocols for K-Parallel Path Networks 

We consider amplify-and-forward (AF) protocols in this paper. In the class of AF protocols considered in this 
paper, the communication takes place in a block of N time instants, during which the channel fading coefficients 
remain fixed. We assume that the edge activations are periodic, and we refer to N as the cycle length of the protocol. 
We shall describe all our protocols in a simple manner, as an edge coloring scheme. Let C = {ci, C2, • • • , cn} be 
the set of iV colors used in the scheme. All the edges in the network are assigned a subset of colors from the set C. 
The subset of colors assigned to the edge will be denoted by A^ . Each color in Aij represents the time instants 
during which the edge eij is active. H However, due to the broadcast nature, a node will experience interference if 
there is any other node connected to this one is transmitting, apart from its intended transmitting node. A protocol 
which avoids this interference is said to be an interference free protocol, which will be of interest to us. Also, in 
the class of AF protocols that we consider, we assume that neither the source broadcasts simultaneously to different 
nodes nor does the sink listen to simultaneous transmission by different nodes. We will see later that imposing 
such a constraint on the protocol is not restrictive, since we are able to achieve the best possible DMT performance 
with such a protocol. 

The upper bound on DMT for the class of KPP networks using the cutset bound ( Lemma [TTT1 ) is given by: 

d(r) < K(l - r). 

Hence, for each of the KPP networks, we shall try to approach this bound. Since this bound corresponds to a 
MISO channel, we refer to this as the MISO bound. We shall prove, by constructing protocols and computing their 
DMT, that this bound can be achieved for all K > 3. 

B. Protocols achieving MISO bound 

In this section we propose protocols for the A' -parallel path network and compute their DMT. For the case when 
K > 3 the DMT of proposed protocols achieve the MISO bound. Also, for the case K = 2 we find the maximum 
multiplexing gain that a protocol can achieve among the class of AF protocols considered in this paper. 

Definition 8: A half duplex protocol is said to be an orthogonal protocol if at any node, at a given time instant, 
only one of the incoming or outgoing edges is active and none of the nodes perform any processing of the symbols, 
but just forwards the incoming packets. We put a further condition that an orthogonal protocol for a KPP network 
has all edges on a given parallel path activated equal number of times. 

Remark 11: In networking literature [29], a network is said to have orthogonal channels if interference is avoided 
at all nodes and each node can communicate with at most one other node at any given time. While Definition [8] 
is similar to this, the notion of orthogonal protocols will be generalized to networks with interference as well in 
Section ED 

Proposition 1: Let C = {c\, C2, cjv} be the set of colors. An edge coloring is a map ip : E —>■ Vc which 
takes eij to A^. 

Every orthogonal protocol can be described as an edge coloring of the network satisfying the following constraints. 
Similarly, every edge coloring satisfying the following constraints describes an orthogonal protocol. 

3 We assume that the network is in operation for sufficient amount of time, so that if an edge is active, the node at beginning of the edge 
always has a symbol to transmit. 
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AaHAj! = (66) 
A in% n A jnj = <j>,i± j. (67) 
AyHAij+i = <f>,j = l,2,...,m-l. (68) 
\Aij\ = mi, j = 1,2, ...,rii. (69) 
Each color in C represents a time slot and so the length of the cycle for the protocol is N. Each color in Aij 
represents the time slots during which the edge eij is active. 

The first constraint corresponds to the fact that for an orthogonal protocol, only one outgoing edge is active at the 
source. Similarly the second constraint corresponds to the fact that for an orthogonal protocol, only one incoming 
edge is active at the sink. The third constraint captures the half duplex nature of the protocol. The last constraint 
indicates that all the edges in a given path are active for equal duration of time so that all the symbols transmitted 
by the source are forwarded to the sink. 

Definition 9: The rate, R of an orthogonal protocol is defined as the ratio of the number of symbols transmitted 
by the source to the total number of time slots. In the notation above, we have 



K 



N 

Definition 10: Consider a KPP network. Let ^1,^2,^3,^4 be four consecutive vertices lying on one of the K 
paths leading from source to sink. Let v\ and v% transmit, thereby causing the edges (v\,V2) and (^3,^4) to be 
active. Due to the broadcast and interference constraints, transmission from V3 interferes with the reception at V2- 
This is termed as back-flow, and is illustrated in FigfT5l 




Fig. 15. Back-flow on a path 



Back-flow can be avoided if we make sure that there is at least two inactive edges between any two active edges. 
We formalize this in the following remark: 

Remark 12: An orthogonal protocol avoids back-flow if the corresponding coloring satisfies the following con- 
dition: 

Aij nA ij+2 = (f),j = 1,2,..., raj - 2. 

By Remark [T2l it is evident that any three adjacent edges e^, ej( J+1 ), and ej(j +2 ) will map to disjoint sets of 
colors when the coloring scheme corresponds to an orthogonal protocol avoiding back-flow. Moreover, it remains 
consistent with the constraints to repeat the same set of colors in every third edge. This suggests an easy way of 
describing the edge coloring. For a given path in the network, we will have three sets of colors in order and they 
are cyclically associated to edges starting from source to sink. For reasons that will become apparent later, the last 
edge (edge connected to the sink) in the given path may get associated to a different set of colors. So, to describe 
an orthogonal protocol, we define a tuple of sets Gi = [G«o, Gn, Gi2\ and a set Pj for all i such that, 



^ = { G ^™ d3 )' j + ni (70) 

Hereafter, we will use G % and F l for i = 1,2, ...,K to completely describe an orthogonal protocol. Here, G l 
specifies the colors that are repeated cyclically on the edges of the path Pj and F l specifies the color on the last 
edge e ini of path Pj. 

Lemma 5.1: Consider a KPP network. If an orthogonal protocol satisfies the following constraints: 
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1) The rate of the protocol is equal to one. 

2) In every cycle, the sink receives equal number of symbols from each one of the K parallel paths. 

3) The protocol avoids back-flow. 

Then the protocol achieves the MISO bouncQ, i.e., 

d(r) = K(l-r) + 

Proof: The induced channel matrix for any orthogonal protocol for a K-parallel path network can be split into 
block diagonal matrices. This is by virtue of the fact that we are dealing with K parallel paths and at any time 
instant, the sink receives a symbol from only one of the K paths. Further, the input symbols can be reordered such 
that the matrices Hi on the block-diagonal contain fading coefficients corresponding to the i-th path. 

So, the induced channel matrix H between the source and sink, considering mK time instants of transmission, 
can be written in terms of the channel matrices Hi, i = 1, 2, • • • K, where Hi is the m x m channel matrix for 
path Pi. 





\ h x 










H = 





H 2 

























Hr 



K 

=> det(I + pHH*) = Y[det(I + pHiHj) (72) 

i=i 

For protocols which avoid back-flow and use all paths equally, the channel matrix for path Pj is given by 

Hi = gJm, t = 1, 2, - ■ ■ ,K. (73) 

where g { = n™=i 9ij 

Consider one cooperation frame of the protocol satisfying the above constraints. Let Xi be the column vector of 
m symbols transmitted by the source to path Pj and y\ be the column vector of m symbols received by the sink 
from the path Pj, 1 < i < K. Since Xi passes through all the edges e^ , 1 < j < rii, before reaching the sink, the 
channel model for one cooperation frame can be written as 



yi 




gl-fm 




Xl 






g2^m 




x 2 


yx _ 




gK^m _ 




XK 



+ n 



Hx + n 



(74) 
(75) 



where n is the equivalent colored noise seen at the sink and H is the equivalent parallel channel. It can be easily 
shown that the noise becomes white, in the scale of interest [9]. The DMT of the above channel, H, can be shown 
to be, 

d(r) = K(l - r)+ , 

which is the MISO bound. Here, the notation (1 — r) + indicates that we must choose the maximum of and 1 — r. 

m 

Corollary 5.2: If any orthogonal protocol has a channel matrix H, with Hi as the channel matrix for the path 
Pj, such that det (J m + pHiHj) = det (J m + pH^H^), where H\ = gil m for i = 1, 2, • • • , K, then that protocol 
achieves the MISO bound. 

Proof: The DMT depends only upon det (/ + pHH^) which remains the same as that in d72l . Therefore the 
DMT remains same. ■ 

4 Throughout the paper, keeping in mind that the number iV of symbols transmitted can be made large, we ignore a rate-loss factor of 
arising from the presence of D units of delay in the network. 



N+D 
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Theorem 5.3: When K > 4, there exists a protocol achieving MISO bound for KPP networks. 

Proof: We now establish an orthogonal protocol for the case when K > 4. By Prop [T] it is sufficient to 
establish a coloring of the edges. We will give the map tp explicitly for the given network by specifying Aij 

We will be using the set of colors C = {ci,c 2 , ...,ck}- In the following, whenever we refer to color a assume 
c = c K and for i > K, a = c (i mod K y 

We will specify the coloring scheme by giving a tuple of sets Gi = [Gio, Gn, Gi 2 ] and a set Fi for all i. 

G i = [{Ci},{Ci+l},{Ci+2}] 

Fi = {c i+3 } 

It is easy to verify that the scheme described satisfies all the constraints of Lemma [57X1 and therefore will achieve 
the MISO bound ■ 



C. Back-flow does not impair the DMT 

Lemma 5.4: Consider a network running an orthogonal protocol, which, in the absence of back-flow creates a 
block-diagonal matrix as the transfer matrix between the input and the output. For such a network, the DMT when 
back-flow is present, is lower bounded by the DMT in the absence of back-flow. 

Proof: The presence of back-flow creates entries in the strictly lower-triangular portion of the transfer matrix. 
Since the DMT of a lower triangular matrix is lower bounded by the DMT of the corresponding diagonal matrix 
(by Theorem 13.31 ). we have that the system with back-flow will yield a better DMT than the one without back- flow. 

■ 

1 ) Back Flow does not alter DMT in the Single Antenna Case: Since we already have a lower bound on the 
DMT of the networks with back-flow, it is sufficient to get an upper bound on the DMT, which is the same as the 
lower bound. 

Lemma 5.5: Consider a KPP network running an orthogonal protocol with single antenna nodes, which in the 
absence of back-flow creates a diagonal matrix as the transfer matrix between the input and the output. For such 
a network, the DMT when back-flow is present, is the same as the DMT in the absence of back-flow. 
Proof: If the network has back-flow, then the channel matrix would be 

h x ... 

h 2 (g 21 ) h 2 
M#3l) h 3 (g 32 ) ^3 



H 



h n (<?ni 



If the network did not have back-flow, then the channel matrix would be 



hi 







... 

h 2 
h 3 



(I + pHH^) is a positive definite Hermitian matrix and by invoking Theorem 16.8.2 of [33], we have that the 
determinant is upper bounded by the product of row-norms: 



det{I + pHH^) < (l + p|^| 2 )(l + p|h 2 | 2 + p| 52 i| 2 |/i 2 | 2 )--- 

(1 + p\h n \ 2 + p\g n (n-l)\ 2 \h n \ 2 H h p\gnl\ 2 \h n ?) 

n 

= J(l + pN 2 (1 + \g l(i -i)\ 2 + ■■■ + M 2 )) 



i=l 
n 



I(l+Pl^ 



i=l 



det(I + P H d Hj) 



(76) 
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The dot equivalence d76l ) follows from equation © in the proof of Lemma 13.11 
Already we have from Lemma 15.41 



det(J + pHH*) > det(I + pH d Hj) 

Therefore we get 

det(I + pHH^) = det(I + pH d Hj). 

Therefore, the DMT with back-flow is the same as without back-flow. ■ 
Theorem 5.6: When K = 3, there exists a protocol achieving MISO bound for KPP networks. 

Proof: By Prop [T] it is sufficient to establish a coloring of the edges. We will give the map ifi explicitly for 
the given network by specifying Aij Vi, j. Define 

_ J 1, m = 1 mod 3 
a% ~ \ 0, m^l mod 3 { ' 

Without loss of generality we assume that the paths are ordered such that for the first I paths, = 1 followed 
by the paths for which cij = 0. We give a protocol for various possibilities of I. 
• Case 1: (I = 0, 1, or 3) 

We will give a coloring scheme such that the corresponding protocol avoids back-flow, uses all paths equally, 
and achieves rate 1 . By Lemma 15.11 this protocol will achieve the transmit diversity bound. 
We will specify the coloring scheme by giving the tuple of sets G t = [Gio, Gn, Gi 2 ] for all i. Gi is defined 
exactly the same way how it is in the proof of Theorem 15.31 

The set of colors used is C = {ci, 02,03}. In the following, whenever we refer to color a, assume cq = c 3 
and for i > 3, = c (i mod 3) . 



For I = 0, 

[{ci}, {c i+2 }, {ct+i}], rn = mod 3 
[{cj}, {c i+ i}, {ci +2 }], in = 2 mod 3 



G 



For I = 1, 

Gi = [{ Cl },{ C2 },{ C3 }] 

[{c2},{ci},{c 3 }], n 2 = mod 3 

[{c2},{c 3 },{ci}], n 2 = 2 mod 3 

[{c3},{ci},{c 2 }], n 3 = mod 3 

[{c3},{c 2 },{ci}], n 3 = 2 mod 3 



G 

G, 



For I = 3, 

G i = [{ci},{c;+i},{Q +2 }] 
Case 2: (I = 2) 

For I = 2, we shall now come up with a protocol such that only one node in the third path encounters back- 
flow. Then, we show that the DMT for this protocol is equal to the MISO bound. We describe the coloring 
scheme for the protocol as follows. 

Gi = [{ Cl },{c 2 },{c 3 }] 
G 2 = [{c 2 },{c 3 },{ci}] 

G3 = [{C 3 },{C1},{C2}] 

After this assignment, we make the following modifications to A^f. 

A 3(n 3 ) = { C 3} 

A 3(n 3 -i) = {ci}, if n 3 = 2 mod 3 

One can check that this will lead to back- flow at only one node, say Rij, in the third path, whose position 
will depend on whether n 3 = (mod 3) or n 3 = 2 (mod 3). 
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For the given protocol, there is no back-flow in paths Pi and P 2 , and therefore, 



Hi 



gil m for i = 1,2. 



(78) 



For path P3, the channel matrix is no longer diagonal because there is back- flow, rather the matrix is lower 
triangular. But according to Lemma 15.51 

det (J m + pH 3 H 3 j ) = det (I m + pH' 3 Hj), where #3 = g 3 I m . 

Therefore, the DMT of the proposed protocol is the same as the case when H 3 is a diagonal matrix, and 
hence, would achieve the MISO bound by Corollary 15.21 



Theorem 5.7: For K = 2 and rij > 1, the maximum achievable rate for any orthogonal protocol is given by 



Rn 



< 



1, 

2n 2 -l 
2n 2 ■ 



n\ + ri2 = mod 2 
ni + n 2 = 1 mod 2 



(79) 



where ni < n 2 . 

Proof: By Prop [Q any orthogonal protocol corresponds to a coloring of the edges, described by the map ip. 
For K = 2, we consider the network as a cycle with edges l\, I2, l ni +na with associated sets of colors 
Di,D 2 , ...,D ni+n2 respectively. Here, 



U 



e 2(n 2 +ni+l-j) ' 



,4 



2{n 2 + ni +l-j) i 



3 < ni 
ni < j < n\ + n 2 

J < 

n\ < j < ni + n 2 



with a single constraint, 



mod (ni+n 2 ) 



(80) 



Now suppose we have a coloring scheme with N colors. Then each color can be an element of the sets of colors 
responding to at most 
must be violated. So we have, 



corresponding to at most [ ni +" 2 j edges. This is because, if there are more colors, then the half duplex constraint 



2 m 



i=i j=i 



< 



ie., n\mi + n 2 m2 < 



ni + n 2 
2^ 

ni + n 2 



iV 



iV 



(81) 



Since > 2 in each of the paths, the constraint (1801) also implies that, 



2mi < AT 
2m 2 < iV 

To find the maximum rate, we pose the maximization problem: 
Maximise + ^) subject to dB, dHS), and ([83]). 
This is easily solved to be, 



(82) 
(83) 



m 2 1 



ni + n 2 



2n 2 ' 
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So the maximum rate of the protocol is given by, 



1, m + n 2 = mod 2 

*%=±, n 1+ n 2 = l mod 2 ^ 



where ni < n 2 . 



Construction 5.8: This construction establishes an orthogonal protocol for K = 2 which achieves maximum 
rate. By Prop [1] it is sufficient to establish a coloring of the edges. We will give the map tp explicitly for the given 
network by specifying Aij Vi, j. 

We consider the network as a cycle with edges li,l 2 ,..., ln!+n 2 with associated sets of colors D\,D 2 ,..., D ni+ri2 , 
as in the proof of Theorem 15.71 

Here, 

eij , j < ni 

e 2 (n 2 +n 1 +l-j) , n x < j <n 1 +n 2 



For K = 2, respectively. Here, 

u 



Mj , j < ni 

A 2 ( n2+ „ 1+ i_j) , nt <j <n 1 +n 2 



with a single constraint, Dj n ^y+i) mod (n x +n 2 ) = 0- 
Case 1: (ni + n 2 ) = mod 2 

We will have C = {c\,c 2 }. Define ifi to be such that 



{ci} , j = 1,3, ...,ni + n 2 - 1 
{C2} , J = 2,4, ...,m +n 2 



Case 2: (ni + 712) = 1 mod 2 We have the set of colors C = {c\,c 2 , ...,cn}, where N = 2n 2 . We will add colors 
to Dj using the following algorithm. 

1) Step 1: Dj <- <j> Vj G {1, 2, n x + n 2 }. 

2) Step 2: Now we will add colors to each of the set Dj using the following algorithm. In the algorithm, 
whenever we refer to Dj, with j > n\ + n 2 , we mean Dj = Dj mod ( ni + n2 ) and with j = 0, we mean 

Dj = D ni J rTl2 . 
{ 

t<-l; 

For k = 1 to n 2 in steps of 1 : 
{ 

For i = 1 to n\ + n 2 — 2 in steps of 2 : 
{ 

A-fc+i *~ A-fc+l U {c t }; 

}• 

t ^-t + 1; 

}• 

For k = 1 to n 2 in steps of 1 : 
{ 

For i = 1 to n\ + n 2 — 2 in steps of 2 : 
{ 

D {n 1 +n 2 )^(k^l)~i <- ^(m+n 3 )-(fc-l)-t U { c *}; 

}■ 

t<-t + l; 

}• 

}• 

Remark 13: The orthogonal protocol shown in construction (15.81 ) achieves maximum rate given in Theorem 15.71 
In case 1, it is clear that rate achieved is 1. In case 2, the number of colors used are 2n 2 . In the first loop of the 
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construction, out of the 112 colors used, — 1 colors are added to either D ni or D ni+ i and all the 122 colors are 
added to either D\ or D ni+ri2 . In the second loop of the construction, out of the 712 colors used, ri2 — 1 colors are 
added to either D\ or D ni+ri2 and all the ri2 colors are added to either D ni or D ni+ \. So the rate of the protocol 
would be 2»£zi. 




(c) Time slot 3 (d) Time slot 4 



Fig. 16. Protocol Illustration: (m, 712) = (3,4) [contd...] 

2 ) Geometric Interpretation: In this subsection, we interpret the protocol constructed by Construction (15.81 ) in a 
geometric manner. We assume n\ < ri2 as in the previous section. As explained earlier, at any given time instant a 
maximum of [ ni +" 2 j edges can be active. Now n\ + ri2 is odd, and due to the half duplex constraint, only alternate 
edges can be active. This means that, if we consider the entire network at any time instant, every alternate edge 
will be colored except for one place, where there will be two consecutive edges that are not active. We will give 
the protocol by specifying at which two consecutive places the edges will not be active, at every time slot. 

Consider the longer path and fix our pointer on the first edge e2i of the longer path P%. Start a cycle from 
this edge (consider the whole network as a cycle now), and activate alternate edges beginning from the next edge 
following the pointer in the clockwise direction, for the first time slot. This defines the set of edges, which are 
active for the first time slot. Hereafter, a set of edges which are simultaneously active at a time slot will be referred 
to as the activation set for that time slot. Now, move the pointer to the next edge e22 of the longer path P2 and 
repeat the same procedure. Now the activation set for the second time slot is defined. Continue the procedure, 
moving the pointer to all of the edges e2i, i = 1,2, ...,ri2- Thus the activation sets for the first 77,2 time slots of 
the protocol is specified. For the next 712 time slots of the protocol, the same procedure is followed, except that an 
anti-clockwise cycle is used instead of clockwise cycle. 

Thus the cycle length of the protocol equals 2ri2- By using this procedure, the edges in the shorter path Pi always 
gets activated every alternate time instant. So, each edge in the shorter path gets ri2 colors. On the other hand, the 
edges on the longer path P2 also get activated alternately except that they give up their transmission opportunity 
twice during the whole duration of 2n2 time slots. So each edge in the longer path P2 gets 112 — 1 colors. 

This illustrated with an example, (m, rt2) = (3, 4). In Fig. [161 activation sets for first 712 time slots of the protocol 
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(c) Time slot 7 (d) Time slot 8 



Fig. 17. Protocol Illustration: (m, 712) = (3,4) [..contd.] 

are defined. Here, we can observe that the pointer moves in the clockwise direction. In Fig. [IT] activation sets 
for the next n,2 time slots of the protocol are defined. Pointer is moved in the clockwise direction in Fig. [T6l in 
contrast, it is moved anti-clockwise in Fig. [T7] 

Theorem 5.9: For a 2-PP network, if the two path lengths are equal modulo 2, then the DMT achieved by the 
orthogonal protocol of Construction 15.81 is equal to the MISO bound, i.e., d{r) = 2(1 — r) + . 

Proof: The proof follows from Lemma 15.11 Lemma 15.51 and Theorem 15.71 ■ 
Theorem 5.10: For a KPP network, there exists an orthogonal protocol achieving the MISO bound as long as 
K > 3 or K = 2 and n\ = ri2 mod 2. 

Proof: Clear by combining Theorem 15-31 Theorem 15.61 and Theorem 15.91 ■ 

D. KPP Networks with Direct Link 

Theorem 5.11: For KPP(D) networks with half duplex relays, single antenna nodes and with a direct link, the 
MISO bound on DMT is achievable whenever there is an orthogonal protocol avoiding back- flow that achieves the 
MISO bound in the absence of direct link. 

Proof: By hypothesis, the given KPP network with half duplex relays and single antenna nodes, achieves 
optimal DMT in the absence of direct link. We know by Theorem 15.31 all KPP networks with K > 3 achieve 
optimal DMT. 

Consider any KPP network with K > 4. We have also established that there exists a protocol, P, with cycle 
length K, achieving optimal DMT, in which the source sends one symbol each through every path during one 
cycle. Now assume that a direct link added between the source and the sink. 

Define a protocol P' as P with a modification such that nodes preceding the sink do not forward the symbols, 
but buffer them. (Each node is assumed to have enough buffer length for this). The protocol P' is run for D time 
slots on the network till all the nodes preceding the sink have at least one symbol in their buffer. Now switch back 
to the protocol P. 
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Up to and including D time slots, the sink receives D symbols through the direct link. After D time slots, 
the sink receives one symbol through the direct link, and another through a relayed path. By the definition of the 
protocol, each symbol transmitted by the source reaches the sink node through the direct link, and through exactly 
one relayed path. Note that each symbol arrives at the sink through the direct link, and a relayed path with a delay 
characteristic of the path. This is the same setting as in Theorem 13.31 and we invoke the results from there. 

Let the total time slots elapsed be M = mK + D for some positive integer m. Then the lower bound for DMT, 
d{.) is given by, 

M 

d(r) > dn^ + ddjf-pr)) 

where, du{r) = (1 — r) + 

d c (r) = K{l-r) + 



As m tends to infinity, the DMT lower bound coincides with the cut-set bound, and thus the optimal DMT is 
achieved. 



VI. Half Duplex KPP(I) networks 

In this section, we consider KPP networks in the presence of interference links between paths, i.e., KPP(I) 
networks. There is no direct link is KPP(I) networks as per the definition. We prove that the MISO bound is 
achievable even in KPP(I) networks. 

The basic idea here is to consider the backbone KPP network for the given KPP(I) network. An orthogonal 
protocol is designed for the backbone network. This protocol is run on the KPP(I) network. It is obvious that 
there are now interference terms in the transfer matrix. However, if the transfer matrix can be written as a lower 
triangular matrix with the K product coefficients on the diagonal, then we can use Theorem 13.31 and prove that the 
MISO bound is achievable. 

A. Inteference does not impair DMT 

Next, we consider the case of causal interference, which we define first. 

Definition 11: Consider a KPP(I) network with single antenna nodes. Let us operate the backbone KPP network 
using an orthogonal protocol which induces an AF protocol on the KPP(I) network. Let H denote the channel 
matrix induced by the AF protocol in the KPP(I) network and H\ denote the diagonal channel matrix induced by 
the orthogonal protocol in the backbone KPP network. If the protocol is such that 

• H is lower triangular, 

• Diagonal entries of H are same as that of H±, 

then the KPP(I) network is said to admit causal interference under that protocol. 

Now we prove a Lemma which asserts that the DMT of a KPP(I) network with causal interference is same as 
that of the backbone KPP network under the same protocol. 

Lemma 6.1: Consider a KPP(I) network with single antenna nodes, running on an AF protocol which admits 
causal interference. Let the induced channel matrix be H, and the diagonal part of H be Hj. Then the DMT of H 
is same as that of H^. 

Proof: The presence of causal interference creates entries in the strictly lower-triangular portion of the transfer 
matrix. Since the DMT of a lower triangular matrix is lower bounded by the DMT of the corresponding diagonal 
matrix, by Theorem 13.31 du{r) > dn d {r). 

Now shall prove that dn{r) < dn d (r), which will complete the proof of the lemma. Since an orthogonal protocol 
is employed, all the entries in a row of the matrix H will have a common term hi corresponding to the fading 
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coefficient of the last link connecting to the sink. So with causal interference, then the channel matrix would be 

hi(gu) ... 

Mff2i) M522) 

^3(531) M332) h 3 (g 33 ) 

h n (g n i) ... ... K(g nn ) 

where every gij is a polynomial function of Rayleigh fading coefficients. Since the interference is causal, if the 
network does not have interference links(i.e., in the backbone KPP network), the same protocol would yield a 
channel matrix, 



hi(gu) 







^2(522) 




h 3 (g. 



33 j 








h n (gr, 



Let Hi 



ht ... 
h 2 
h 3 







(I + pHH^) is a positive definite Hermitian matrix and by invoking Theorem 16.8.2 of [33], we have that the 
determinant is upper bounded by the product of row-norms: 



det(I + pHrf) < (l + p!7ii| 2 |5ii| 2 )(l + p|/i 2 | 2 |522| 2 + p|52i| 2 |/i2| 2 )--- 

(1 + p\h n \ 2 \g nn \ 2 + p\g n {n-i)\ 2 \h n \ 2 H V />|5m| 2 |/in| 2 ) 

n 

= J(l + p\hi\ 2 (\ gii \ 2 + |<7i(i-i)| 2 + • • • + \gn\ 2 )) 



i=l 
n 



= n( i+ ^i 2 ) 

1=1 

= det(J + p J ffiiTi t ) 
The dot equivalence (|85l ) follows from equation ([8]) in the proof of Lemma 13.11 



(85) 



Now, det(I + pHH*) < det(I + pHiHj) 

= det(I + pH d Hj) 
^d H {r) < d Hd {r) 



(86) 



Equation (1861 ) follows from the fact that product of absolute value of Rayleigh random variables is equivalent to 
a single Rayleigh random variable in the scale of interest, as long as all the variables involved in the two matrices 
Hi and Hd are independent. ■ 



B. Causal Interference 

By Lemma loTTl it is clear that the cut-set bound for a KPP(I) network can be attained if there is a protocol that 
yields a lower triangular matrix with K independent coefficients along its diagonal repeated periodically (except 
maybe the first D time instants). Specifically if the input-output relation can be written in the following form, then 
a DMT of d(r) = K{\ — r) + is achievable. 
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gK 

5ij and * denotes any entry, either zero or non-zero. 



+ n 



(87) 



This would be our aim in the rest of the section - to establish when it is possible to find a protocol yielding such 
a channel matrix. Let us first consider the KPP network without interference, running on an orthogonal protocol. 
In this case, due to the different delays on the different paths, an input-output relation like Equation (l87l) does not 
hold immediately. In order to do so, first we consider a permutation of the input for which it is possible to do so. 

We consider symbols received by the sink from nth time instant onwards, with n sufficiently large enough, such 
that the sink receives symbols from all the K paths periodically. Consider the received symbols y n +i, y n +2, ■ ■ ■ , Un+K, 
in K consecutive time instants, each of the symbol traversing a distinct path. Let the symbols received at time 
n + i be x m . and assume that the data comes through path Pj. Let us consider the transfer matrix between 
y n+ i,y n+ 2, ■ y-n+K and 



y n +i 




gi 




Xmi 




y n +2 




g2 




x m 2 


+ n 


_ y n +K _ 




gK _ 




x mK 





(88) 



Now consider any KPP(I) network built on the above backbone KPP network. We will give a sufficient condition 
on the interference so that the channel matrix has a structure like Equation (l87l) . 

Proposition 2: If the interference in a KPP network, running a particular protocol, has the following property: 
For each backbone path the following conditions are satisfied: 

• Condition 1: The delay experienced by data travelling on any other path from the first node of the backbone 
path should be no lesser than the delay on the backbone path from the first node to the sink. 

• Condition 2: The unique shortest delay from the first node on the given path to the last node on that path is 
through the actual path from that node to the sink. 

Then the matrix connecting the output and a permuted version of the input will be lower triangular with K 
independent coefficients along its diagonal repeated periodically (except maybe the first D time instants). 

Proof: Consider the KPP network with interference. Reduce this to a network without interference, i.e. assume 
that relays in different paths are isolated from each other and write the input-output transfer matrix as in Equation [88] 

Let us consider a given symbol x m . transmitted from the source. We are now looking for all possible ways in 
which this data can reach the sink, since these contribute to the entries other than the diagonal entries in the matrix 
that we are interested in. We want to get a lower triangular matrix with the K product coefficients appearing on 
the diagonal. 

A symbol from the source x m% can get to a sink only after it is passed through the first node on the actual path 
in which it was intended to be sent if there were no interference. So we are interested in all possible path delays 
from the first node on the actual path to the sink. 

If the data reaches through all other paths later than it does on the backbone path, then the matrix is bound to 
be lower triangular. This is ensured by Condition 1. Now, we want the coefficients on the diagonal to be equal to 
gi. This requires that there is no path of same length splitting from a path and merging back into the path with the 
same delay as the actual path. This will add another coefficient to the gi which might create a problem. To ensure 
that this does not occur, we have Condition 2. 

More formally, since the network satisfies Condition 1 of theorem above, we have that given that a symbol x m% 
influences output y n +i through the shortest path, the same symbol x mi will not influence any yj, for j < n + i. 
Since the network satisfies Condition 2 of theorem above, we have that the symbol x m% is coupled to y n% through 
gi, since there is no other coefficient that sums to this. 



This means that in the representation given by Equation [88J a given column corresponding to the input x mi will 



look like: Column j = [0 ... gi 



k] , where * denotes some entry (zero or non-zero). 
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This clearly means that the matrix representation is lower triangular with gi on the diagonal repeating periodically, 
i.e., it is of the form (I87T ) and therefore, by Theorem 13.31 the upper bound on DMT is achievable: d{r) = K(l — r). 

■ 

Remark 14: The conditions in this proposition depend on the actual delays experienced by the data travelling 
through various paths. However, the actual delays depend on the protocol used. To simplify the criterion in terms 
of characteristics of network topology, we define a class of protocols with "almost continuous activation" in the 
next section. This modified criterion can be computed by a simple examination of the network. 

C. Protocols with Almost Continuous Activation 

In this section, we define a class of protocols with "almost continuous activation" where in conditions in 
Proposition [2] can be reduced to conditions on the path lengths of the network. 

Definition 12: An orthogonal protocol for a KPP network is said to have continuous activation at a relay node 
if the node transmits whatever it receives from the incoming edge in the last instant in the immediately next time 
instant. 

Definition 13: An orthogonal protocol for a KPP network is said to have continuous activation if the protocol 
has continuous activation at all relay nodes. 

Definition 14: An orthogonal protocol for a KPP network is said to have almost continuous activation if the 
protocol has continuous activation at all relay nodes except possibly the first hop node on each parallel path. 

Protocols with almost continuous activation will be used in the future sections to establish a sufficient condition 
for achievability of DMT upper bound. Protocols with almost continuous activation have the property that the data 
passes continuously through the edges of the backbone paths of the KPP network in successive instants after the 
first hop. 

Theorem 6.2: For a KPP network without interference, there exists a protocol with almost continuous activation 
whenever K > 3. 

Proof: Let us assume without loss of generality that the paths are ordered in ascending order of their sizes 
ordered modulo K. Let us consider a given path Pi. Let us fix the color on the first edge to be q, i.e., An = Ci. 

The next edge can be anything other than c, in order to satisfy the half duplex constraint. Once the color on the 
next edge is fixed, the colors on the rest of the edges are known because the protocol must have almost continuous 
activation. Let the next edge have color c m . Ai2 = c rn and we know that m / i. So we must color the remaining 
edges consecutively: Ay = c m+J _2,j > 2. 

We have K — 1 choices for m and therefore these will lead to K — 1 different colors for the last edge e,i ni . These 
are all possible colors c\, C2, ck except the one color that will appear on the last edge if Ai2 = c%. Let us try 
to determine the one color that can not appear on the last edge, because if it does, then the half duplex constraint 
will be violated. 

Let m = a mod K. Then if A i2 = Cj, then A in . = C( i+a _ 2 ) mod K. 

This means that if the starting color is q, then there are K — 1 colors allowed except the one stated here: 
Mn x 7^ C(j_|_ n ._2) m od k- Let S{ = C \ {c(j +ni _ 2 ) mot j k}- Therefore, Si is the set of all allowed colors on the last 
edge in path Pj. We represent this symbolically by Cj < — > Cj,Vcj 6 Si, where < — > denotes the terminal edge 
compatibility relation. 

Now we have a set of starting colors A = {ci,i = 1,2, ...,K}. The set of ending colors (i.e., the colors on the 
ending edges) should also be the set B = {cj, i = 1, 2, K} since we want a rate one protocol. Now visualize a 
bipartite graph Q between the sets A and B. Where Cj in A is connected to cj in B if cj G Si. 

Definition 15: A complete matching on this bipartite graph Q is a subgraph of Q where every node in A is 
connected to exactly one node in B and these nodes in B are distinct. 

Any complete matching on Q specifies a protocol with almost continuous activation and vice versa, since a 
protocol with almost continuous activation is specified by just the starting and the ending colors. From the theory 
of bipartite matching [31], we have the following proposition: 

Proposition 3: Let Q be a bi-partite graph from set A to set B. Let X C A be any subset of A. A complete 
matching from A to B exists iff 

|r(X)| > \X\,VX C A (89) 
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where T(X) denotes the set of all nodes that are adjacent to any node in X on the graph Q. 

Proposition 4: The bipartite graph Q has a complete matching whenever K > 3 
Proof: The bipartite graph Q has a complete matching iff |T(X)| > |X|,VX C A. 

Since each element in A is connected to K—l nodes in the set B, we have that |T(X)| > K— 1, VX C A, X ^ 
This means that the condition is satisfied automatically for the sets for which < \X\ < K — 1. 

Now the only condition to check is when \X\ = K. In this case the condition d89l reduces to 

U*Li Si = C (90) 

This condition is violated ^=^> all the Si are equal. 

All the C( i+ni _ 2)modif are equal. 
-4=^- All the (i + rii — 2) mod K are equal. 
-4=^- All the (i + rii) mod if are equal to M (say). 

<4=> All the rij are distinct modulo K and (i + rii) mod if are equal to M, for i = 1, 2. 
Now since, all the rii are distinct modulo if and the paths are ordered in ascending order of their sizes ordered 
modulo if, we have m = i — 1 mod if. 

=>■ All the Hi are distinct modulo if and (1 + 0) mod if = (2 + 1) mod if. 
-4=4> all the rii are distinct modulo K and = 2 mod if. 
if < 2. 

Therefore there is ?io complete matching on the bipartite graph =>■ if < 2. The contra-positive of this statement 
is that, 

if > 2 => There is a complete matching on the bipartite graph. 

Therefore a complete matching exists whenever K > 3. This proves the proposition. ■ 

Since a protocol with almost continuous activation exists whenever a complete matching on the corresponding 
bipartite graph exists, we have that protocols with almost continuous activation exist whenever K > 3. Hence the 
theorem ■ 

Now, we can translate conditions on the delay in Proposition [2] into conditions on path lengths while using 
protocols with almost continuous activation. This is formalized in the following proposition: 

Proposition 5: If the interference in a KPP network, running a protocol with almost continuous activation, has 
the following two properties, then the matrix connecting the output and a permuted version of the input will be 
lower triangular with K independent coefficients along its diagonal repeated periodically (except maybe the first D 
time instants). For each backbone path, 

• Condition 1: The length of any other path from the first node should be no lesser than the delay on the 
backbone path from the first node to the sink. 

• Condition 2: The unique shortest path from the first node on the given path to the last node on that path is 
through the backbone path from that node to the sink. 

Proof: This follows directly from Proposition [2] and Remark [14] ■ 
1) Optimal DMT for regular networks : Now we show that the MISO bound is achievable for regular networks. 
Theorem 6.3: The optimal DMT d(r) = L(l — r) + of (K,L) Regular networks is achievable. 

Proof: Consider a (K,L) regular network. It can be treated as a KPP(I) network and therefore the back-bone 
KPP network can be run using an orthogonal protocol with almost continuous activation. Consider the following 
protocol with almost continuous activation. Let the colors be c±,C2, ■■■,ck, and assume Co = ck and q = q mo dif. 
Aij = = 1,2, ..,K,j = 1,2, ..,L + 1. 

With this protocol it can be seen that interference is causal, i.e., interference satisfies the conditions of Prop. [2] 
Therefore, the optimal DMT of L(l — r) + is achievable for these networks. ■ 

Corollary 6.4: For a (2,L) layered network, a lower triangular transfer matrix which contains the two product 
coefficients corresponding to the two parallel paths alternately on the diagonal can be obtained using the protocol 
with almost continuous activation. 

Corollary 6.5: For the two-hop relay network without direct link, the optimal DMT is achieved. 

Proof: The two-hop relay network without the direct link is a (K,l) regular network, where K denotes the 
number of relays in the network. Thus Theorem 16.31 implies this corollary. ■ 

Remark 15: The result in Corollary 16.51 was also proved in an independent work [25]. The protocol used in this 
paper and in [25] are essentially the same as the SAF protocol [17], except that it is used in a network without 
direct link. However, the proof techniques used here and in [25] are very different. 
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2) Optimal DMT for KPP(I) networks: In this section, we prove that the MISO bound can be achieved on all 
KPP(I) networks, with K > 3. 

In Prop. |21 we gave a sufficient condition to establish when a network can be used along with a given protocol 
in order to achieve the optimal DMT. Later in Prop. [5J we gave a sufficient condition on path lengths in a network 
such that the network can be used along with a protocol with almost continuous activation to get the optimal DMT. 
Suppose the network does not meet the sufficient condition given in Prop. [5J It is possible that the protocol can 
be modified to make the network meet the sufficient condition of Prop. |2 We do so here by adding delays to 
internal nodes of the network such that, even though the path lengths do not satisfy the constraints, the delays do. 
By appropriately choosing a protocol and adding delays, we can make the network and the protocol jointly satisfy 
the conditions of Prop. |2] This leads us to the following Theorem: 

Theorem 6.6: Consider a KPP(I) network with K = 3. There exists a set of delays which when added appropri- 
ately to various nodes in the networks, and when used along with the protocol with almost continuous activation, 
satisfies the conditions of Prop. |2 

Proof: The proof is omitted here for brevity. The proof makes use of decomposing the given network into 
various layers, each of which can be balanced individually and the layers can put together to give a solution for 
the entire network. ■ 

Theorem 6.7: Consider a KPP(I) network with K > 3. The cut-set bound on the DMT d(r) = K(l — r) + is 
achievable. 

Proof: For K = 3, it follows from Theorem 16.61 
Now, we will consider the case when K > 3. Consider a 3 parallel path sub-network of the original network. 
By Theorem 16.61 we can get a matrix with these three product coefficients along the diagonal. There are now C3 
possible 3PP subnetworks. If each of these subnetworks is activated in succession, it would yield a lower triangular 
matrix with all the K product coefficient gi repeated thrice K choose 3 times on the diagonal. By Theorem 13.31 
the DMT of this matrix is better than that of the diagonal matrix alone. The diagonal matrix has a DMT equal to 
K{\ — r) + . Therefore a DMT of d(r) > K(l — r) + can be obtained. However, since d(r) < K(l — r) + by cutset 
bound, we have d(r) = K(l — r) + . ■ 



VII. Layered Networks 

Lemma 7.1: Let H C {h 11} h n , hi Ml } X {h 2 i, h 22 , h 2 M 2 } X ••• x {hxi, h K2 , Ii KMk }- Let \H\ = N. 
Let each hij appear in iVj of the terms in H irrespective of j. Then TVjMj = N. Let N max := max^ iVj and 
M min := minf =1 M { . 

Let hi, i = 1, 2, .., N be the elements of H. 

Let ip : H — > G be a map such that ip((ai,a 2 , ...,clk)) = nj^a;. Now let gi = ip(Hi), i = 1,2, ...,N. Then 
each gi is of the form H^ =1 h k i^ i k ^, where k) is a map from [N] — > [Mj.] for a fixed k 6 [K]. 
Let H be a N x N diagonal matrix with the diagonal elements given by Ha = ft. 

The DMT of the parallel channel H is a linear DMT between a diversity of j^— and a multiplexing gain of N: 

d{r) = (N ~ r)+ (91) 

-^max 

Proof: 

Let us assume without loss of generality that N\ > N 2 > ...Nk- 
H = diag (Ha). Ha = U^ =1 h k i(i >k y 

Consider a variable transformation where a k j is defined such that p~ ak] = \h k j\ 2 . 
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Now the DMT d(r) is given by the following defining equation: 

p -d(r) _ Pr { log det (j + p HH ] ) < r log p} 

= Pr{det(I + pHH ] ) < /} 

= Pr{n£i(l + p|ff»| 2 )</} 

= Pr{Uf =1 (l + pU^ =1 \h kim \ 2 )<p r } 

= Pr{niI 1 (i + P nf =lP -«)<p r } 

= Pr{Iif =l {\ + p l -^ a ^'^) < p r } 

= Pr{Uf =1 p^^^ a "^ + < p r } 
N K 

= Pr{^(l-^a fei(4jfc) ) + <r} (92) 

i=l k=l 
N K 

< Pr{J2(^-J2 a kim)<r} (93) 

i=\ k=l 

K M k 

= Pr{N-^N k Y,<* kj <r} 

k=i j=i 

The last equality follows since each \hij\ 2 appear in TYj of the terms in H irrespective of j and so do the 
corresponding a^. Let d\{r) be defined as the SNR exponent of the RHS in the last equation above, i.e., 

K M k 

Pr{7V-^iV fe ^a fci <r} = p~ d ^ (94) 

fc=l j=l 

Now, 

d{r) > di(r) (95) 

inf y)X>Jy (96) 

{7V-Ef =1 W» E" fc i a«<r , a„>0} ~ 
Af fc 

K 

di(r) = inf Va fc (98) 

{^-EjLi JV*o*<r , a fc >0} *^ 

if 

inf Va fc (99) 

{I2 k= iN k a k >N-r , a fc >0} 

Claim: The infimum of X^fc=i a fc under the constraint {^fc=i ^fc^A; > N — r} , a k > is attained by a.\ = 
^fif-, OLi = 0,Vi = 2, ...,iV and the value of the infimum is ^jjf-. 
Proof: The proof is simple and is skipped here. 
This claim implies that d(r) > di(r) = ^j^ 1 -- 

Now we will check that this lower bound is infact equal to the DMT of the channel. Let us consider an assignment 
of akj suggested by the claim above: ay = = = L2, .., Mi. 

From d92l ), we know that 

d{r) = <r* a r« i nf )+ < r « >o> (100) 



Define 
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We have to verify that this assignment yields the infimum under the constraint stated here. 

Claim: The infimum of Ylk=i 2~2j^i a kj under the constraint {X)£Li (1 ~~ 2~2k=i a ki(i,k)) + — r , a kj > 0} is 
attained by ay = ^r 1 , j = 1,2, ..,Mi, a/y = 0,Vfe > 1 and the value of the infimum is 

Proof: Since the objective function is convex, local minimum is the same as global minimum. It is sufficient 
to prove that the stated is a local minimum. To prove that, we show that the objective function does not 

decrease in a neighbourhood of the claimed optimal point. Let us assume that a kl = 5m > 0, i = 2, K. 

Since a±j = < 1, we have that all terms in the summation Yl%=i (1 ~~ 2Sk=i a ki(i,k)) + w& non-zero. By 
choosing 5^ small enough, we can ensure that all terms in the summation are non-zero. 

N K 



i=l k=l 
N K 

E( 1_ E a Hft*)) ^ 



i=l k=l 

K M k 

^-E^E^- ^ r 

k=l 3=1 
K M k 

^ N ~ r 

k=i j=i 

M k K M k 

3=1 k=2 j=l 

K M k K M k 

k=l 3=1 k=2 j=l 

K M k K M k 

2^2^ a k3 ^ -^ + E Nl E^ 



k=l j=l k=2 j=l 

N 



K M k 

EE^-i ^ 

k=l j=l 



The last equation follows since N\ — Nj, > 0, k > 2 and 5^ > 0. 

Therefore a\j = ^r",j = 1,2,.., Mi, a^j = 0,V/c > 1 is a local minimum, and thereby a global minimum. 
This yields a DMT of 



K M k M 1 

d ( r ) = E E ak 3 = E a y 

k=l j=l j=l 

s^N-r N -r 

d{r) = E — = — 

Thus d(r) = di(r) = is indeed the DMT of the channel described. 

■ 

Definition 16: Given a set of paths P in a layered network, the bipartite graph corresponding to the path set P 
is defined as follows: 

• Construct a bi-partite graph with vertices P on the left and vertices P again on the right. 

• Connect an element Pi on the left to Pj on the right if the two paths are node disjoint. 

Lemma 7.2: Consider a set of paths A := {ai,i = 1,2, ...,N} in a given layered network. Let the product of 
the fading coefficient on the i-th edge disjoint path cij be <?j. Construct the bi-partite graph corresponding to A 
according to Definition. [16] If there exists a complete matching in this bi-partite graph, then these edges can be 
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activated in such a way that the DMT of this protocol is greater than or equal to the DMT of a parallel channel 
with fading coefficients gt,i = 1, 2, ...,N with the rate reduced by a factor of N, i.e., d(r) > dn d (Nr), where 
H d = diag(g 1 ,g 2 , ...,ffjv) 
Proof: 

Suppose there is a complete matching it on the graph constructed as above. The complete matching specifies for 
every edge disjoint path on the left <Zj, a partner on the right a Wi . The length of each path and therefore the delay 
is equal to D := L + 1. 

Step - 1 : Activate path a\ along with path a ni for a period 2T, where T > D: treating these two paths as a 
2 — PP Network, since these two paths are node disjoint.This network potentially has interference, but no direct 
link. Since this network is a subnetwork of a layered network, this 2-PP network has both the edges to be of the 
same length and causal interference and therefore rate-1 can be achieved on this network by Corollary 16.41 So the 
technique used in Section. IVI-C.ll can be used on this network to get a matrix, with zeros on the first D rows. 
After deleting these D rows, the matrix will be lower triangular due to causal interference and the diagonal in the 
matrix comprised of coefficients equal to g\ and g Ul alternately for T — D durations each. After this is done, the 
various nodes in the network store the data that have not yet been passed to the sink. This data will be used in the 
future when this path is activated again. 

Step - 2 : Repeat Step - 1 for all the paths ai,...,ajv- The net transfer matrix will comprise ND zero rows, 
which effectively signifies a rate loss. 

On removing these zero rows we get a transfer matrix, H. The DMT of the protocol is d(r) = dn(2NTr). By 
using Theorem 13.31 we get that dn{r) > dn^r), where H\ is the diagonal matrix corresponding to the matrix H. 
But H\ contains 2T — D entries each of gi, therefore this matrix DMT is given by dff^r) = dH d ( m^B r ) where 
H d = diag{ gi ,...,g N ). =► d(r) = d H {2NTr) > d Hl (2NTr) = dn^N^^r). 

For T tending to infinity, we get d(r) > dn d (Nr). 

m 

Remark 16: This activation can also be done in a cyclic way in order to reduce the delay of data transfer. In the 
modified scheme, the method used above can be repeated for L cycles. Now, instead of letting T going to infinity, 
we can tend L to infinity to get the same DMT as above. 

A sufficient condition that guarantees that a linear DMT between the maximum diversity and multiplexing gain 
on a general layered network is given in Lemma 17.31 

Lemma 7.3: For a general layered network, a linear diversity multiplexing tradeoff of d(r) = d maK (l — r) + 
between the maximum diversity gain <i max and the maximum multiplexing gain 1 is achievable whenever the 
bipartite graph corresponding to the set of edge disjoint paths ej, i = 1,2,..., d max from the source to the sink has 
a complete matching. 
Proof: 

By using Lemma [7721 we will be able to get a DMT of d(r) = dH d (d max r). But since the paths are edge disjoint, 
the fading coefficients are independent, we get dn d (r) = (<i max — r) + . Therefore, we get, d(r) = d max (l — r) + ■ 

Definition 1 7: A path from a source to sink in a layered network is said to be forward-directed if all the edges 
in the path are directed from one layer to the next layer towards the sink (i.e., no edge in the path goes from one 
layer to the previous layer and there is no edge which starts and ends in the same layer.) 

Lemma 7.4: Let Pi, ..,P/v be the set of all forward directed paths in a fully connected layered network. Then 
the bipartite graph of the path set P has a complete matching. 

Proof: We will prove this by producing an explicit complete matching on the bipartite graph. Let the layered 
network have L layers. Let there be Ri relays in the z-th layer. Let us fix an (arbitrary) ordering on the relays in 
each hop. Let the relays in the j-th hop be indexed 0, 1, Rj — 1. The number of paths is given to be equal to N. 

A forward-directed path Pj is specified completely if all the relays through which the path passes. This is denoted 
by the L tuple Bi = (pn, bn), where bij denotes the index of the relay in the j-th hop through which path p 
passes. Each L-tuple specifies a path from source to sink, since the layered network is fully connected. Now in 
this notation, two forward-directed paths Pj and Pj are node-disjoint if the tuples R>i and Bj are distinct in all the 
L positions. 

Consider a map a : P — > P, where 

a(Pi) = a{Bi) = a(bn,b i2 , b iL ) = (b a + 1 mod R lt b i2 + 1 mod R 2 , b iL mod R L ). 
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It can be checked that this map is a bijection from P to P. Since Ri > 1 Vi, Pi and a(Bi) are point-wise 
distinct, and thereby the paths Pj and a(Pj) are node disjoint. Therefore the map a defines a complete matching 
on the graph. ■ 

Theorem 7.5: For a fully-connected layered network, a linear DMT between maximum diversity and maximum 
multiplexing gain of 1 is achievable. 

Proof: Consider a fully connected layered network with L layers. Let there be Ri relays in the i-th layer for 
i = 0, 1, L + 1. Let Ro = Rl+i = 1 since there is one source and one sink and Mj := Pj_iPj, i = 1, 2, L+l 
be the number of fading coefficients in the i-th hop. Let hij,j = 1,2, ..,Mj be the fading coefficients on the i- 
th hop for i = 1,2,...,L + 1. Let iV be the total number of forward-directed paths from source to sink, and 
Pi,i € [N] be the various forward-directed paths. Let P denote the set of all these forward-directed paths. Then 
\P\ = N = Hf =l R{. Let gi be the product fading coefficient on path Pj. 

Let M min — min^j^ M{. Then d m3X — M«im by Theorem 14. 1 1 

By Lemma 17.41 the bipartite graph corresponding to P has a complete matching. P satisfies the criterion of 
Lemma 17721 and therefore, we can obtain a DMT of d(r) > dn d {Nr). Now, we need to compute dn d {r). To that 
effect, we make the following observations, which will enable us utilize Lemma 17.11 

A given path Pj can be alternately represented as the set Gi = (huuu, /i2i(i,2) ; ••■> h(L+i)l(i,L+i)) °f fading 
coefficients on that path. Consider the set of all Gi, i.e., G = {Gi, i 6 [N]}. 

Now let gk,k £ [N] be the product fading coefficient on path G. Now clearly 

G C {hu,h 12 , ...,hi Ml } X {7l 2 l, /t22j •••) h2M 2 } X ••• X {^(i+l)l) tyi+l)2: •■•> ^(L+l)M £+1 } 

Now each ^ appears in the same number iVj of terms in G irrespective of j, where Ni = jj- and _/V max = 

If t/> is defined as in Lemma im then gi = ip(Gi). Now we have satisfied all the conditions of Lemma 17731 and 
therefore, d Hd ( r ) = jj^- 
Now 

d(r) > d Hd (Nr) 

(N - Nr) + 
~ N 

1 v max 

= M min (l-r)+ 
^d(r) > d maK (l-r)+ 

m 

For fully connected layered networks with L < 4, the min-cut is either at the source side or at the sink side, and 
hence we have the following corollary: 

Corollary 7.6: For a fully connected layered network with L < 4, the optimal DMT is achievable. 

Proof: Consider a layered network with L = 1, i.e., there is only one layer. Let there be n\ relay antennas in 
the relaying layer. The DMT upper bound is m(l — r) + from the cut-set bound, which is achieved. 

Let L = 2 and there be n\ and n 2 relays in layers 1 and 2. Then the cutset bound on DMT is min{rai, n 2 } (1— r) + , 
which is achieved. 

Let L = 3 and there be m, n 2 ,n^ relay antennas in the corresponding layer. It can be seen that d max = 
mm{ni,n 2 } and that the DMT upper bound is min{ni,n2} (1 — r) + , which is indeed achieved. ■ 

VIII. Networks with Multiple Antenna Nodes 

In this section we consider families of single source single sink networks with potentially all nodes having 
multiple antennas. We consider KPP networks with interference and Layered networks under both half duplex and 
full duplex constraint. 
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Fig. 18. Comparison of various protocols for (2,4,2) network 

A. Achievable DMT for Certain Networks with Multiple antenna nodes 

1 ) Full Duplex Layered Networks: We consider layered networks with multiple antennas at the source and the 
sink. Multiple antennas at relays can be handled by replacing the relay with multiple single-antenna relays in the 
same layer. We do not assume directed antennas and consider undirected edges. However this creates a back-flow, 
which induces a lower triangular matrix, that we handle using Theorem. 13.31 

Definition 18: A single source single sink layered network with multiple antennas at the source and the sink is 
referred to as an (no, n±, . . . , nj,, ^l+i) network if the network has L layers, with the source having no antennas, 
the sink having n^ + i antennas, and the i-th layer of relays having rii nodes with single antennas. 

In [16], parallel AF and flip-and-forward (FF) protocols have been proposed for the (no, n\, . . . , til+i) network 
with full duplex operation and directed antennas, so that back-flow is avoided. The parallel AF protocol aims to 
achieve the full diversity for the network, whereas FF achieves the extreme points of full multiplexing gain and the 
full diversity gain. In [16], it has been proved that FF achieves a better DMT than AF. However, the DMT curves 
of both these protocols lie far away from the cut-set DMT bound. We propose a protocol with achievable DMT 
better than the existing protocols for a (no,ni, . . . , Ul+i) network under the full-duplex constraint. 

In parallel AF and FF, the key idea is to partition the relay nodes in each layer into subsets of nodes called 
super nodes. A sequence of consecutive super nodes from source to sink form an AF path, and a set of AF paths 
is defined as a parallel partition in [16]. An independent parallel partition is defined as a parallel partition where 
any two different AF paths do not share common edges [16]. 

We propose a protocol which uses different partitioning depending upon the multiplexing gain r (we will refer 
to r as the rate by abuse of notation). H The basic intuition is that, at lower rates, we can exploit the diversity of 
the network by creating more parallel AF paths. At higher rates, super nodes are to be chosen such that each AF 
path has enough degrees of freedom. 

Let Pj be the number of partitions in layer i. Let P denote a particular partitioning which is specified by the 
vector of (Pq, Pi, P2, Pl+i) and let V denote all possible partitionings. 

Given that the layer i has Pi partitions, the number of independent AF paths is 

N = min P t P i+l 
{i=0,l,2,...,£} 

The protocol is as follows: Activate all the N parallel paths successively so that each path is activated for T 
time instants. During the activation of ith path, we will get a transfer matrix that is block lower-triangular with Hi, 
the product matrix for the i-th path on the diagonal. Since the matrix is lower triangular, the DMT of this matrix 
is better than the DMT of Hi. Let di(r) be the DMT of this matrix, which can be computed using the techniques 

5 The idea of varying the protocol parameters depending on r was used in [9] for the NSDF protocol. 
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for computing the DMT of product Rayleigh matrices in [16]. Now the DMT of this induced channel can be given 



using Theorem 13.31 and the parallel channel DMT in Lemma L„ 



du{r) > sup inf /Ai( r i) 

{PeV} {(r u r 2 ,- ,r N ): E™i^=r} i=1 

The DMT of the protocol can be given as d(r) = dn(Nr). 

Since the optimization is over the set of all possible partitions, it might be difficult to compute the DMT in 
general. So we consider a restricted case when the source and sink are unpartitioned, and all the relay layers 
are partitioned into the same size, P. Under this assumption, we have that 1 < P < n mm . Let d( no ,m,...,ni, + i)( r ) 
denote the DMT of a product channel (no, n\, ul+i), which we can compute using the technique given in [16]. 
Let nf := ,i = 1,2,..., L. When the relay layer i is partitioned into Pi partitions, each partition contains 
at-least nf relays. If it contains more, the remaining relays are requested to be silent. This is done for simplicity 
of computing the DMT. 

The strategy of Theorem 14.21 can be used to obtain a DMT of rf max (l — r) + for a layered network (see 
Corollory 14- 3b - By combining this strategy with the aforementioned strategy and chosing the one with the better 
DMT based on r, we get a DMT of 

d{r) > max{<i max (l - r) + , 

sup P d (n0infi ... in P inL+l) (r),} (101) 

{PS[n mi n]} 

The proposed protocol is essentially the same as [16] except for the following differences: 

• We consider un-directed graph which gives rise to back-flow. We are able to handle back-flow by using 
Theorem 13.31 

• We consider partitions of arbitrary size. Evaluating the DMT with arbitrary sized partitions is made possible 
because of the parallel channel DMT in Lemma 13.51 

• The size of the partition is made variable with respect to the rate. 

• We will show that this result can be extended to half-duplex networks under the assumption that all partitions 
are of equal size with Pj > 1. 

• It can be shown that the DMT of the RHS in (1 1 1 b is strictly better than that of the FF protocol 
Example 1 : Consider a (2, 4, 2) layered network. The achievable DMT curve using the FF protocol, the proposed 

protocol and the cut-set bound are plotted in the Figure [18] 

2 ) Half-Duplex Layered Networks: We consider multi-antenna Layered networks with the additional constraint 
of half-duplex relay nodes. We prove that the methods provided above for full duplex networks can be generalized 
for the half duplex network with bidirectional links. 

Consider the partitioning method stated for full-duplex layered networks, with Pj = P,\/i = 1,2,...,L, i.e., the 
relaying layers are partitioned into equal number of partitions. Let the source and sink be un-partitioned. When 



relays. If it contains 



the relay layer i is partitioned into p partitions, each partition contains at-least nf := 
more, the remaining relays are requested to be silent, as in the full duplex case. 

The following observations are in place: Once we replace the nodes corresponding to the same partition by a 
super-node, this virtual network forms a regular network. This is because each relaying layer has the same number of 
partitions and therefore the same number of super-nodes. Therefore, this network can be treated as a KPP networks 
with paths having equal lengths if P > 1. We use a protocol with continuous activation on this regular network. 
Since the paths are of equal length, the interference is causal making the induced channel matrix lower triangular. 
This has better DMT than the corresponding diagonal matrix by Theorem 13.31 This yields the same lower bound 
on DMT as in the full duplex case. Thus the DMT of the half duplex network with the protocol is better than using 
the network with a full duplex protocol and using the same partitioning. So we get: 

d(r) > max{(i max (l - r)+, 

SUp P d(n ,nf,...,n£,n i+1 )( r ) ,} (102) 

PS{2,3,..,n mm } 



6 However, the fact that FF protocol does not depend on r can make practical implementation simpler 
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Example 2: For the case of (2, 4, 2) network with half-duplex constraint, the proposed protocol achieves the same 
DMT as the full duplex case of Example 1. However, the FF protocol used naively for a half-duplex system will 
entail multiplexing gain loss by a factor of |. 

B. KPP(I) Networks 

Consider KPP(I) networks with multiple antennas at the source and sink and potentially at all intermediate nodes. 

1 ) Full duplex KPP(I) Networks: We consider full-duplex KPP(I) networks with multiple antenna nodes. Given 
an underlying path Pj, we activate all edges in the Pj simultaneously. Let us call this process as activating the 
path Pj and the fading matrix thus obtained as Gj. So Gi = Hf =l Hij. Let the DMT corresponding to this product 
matrix be dj(r), which depends only on the number of the antennas on the path Pj and can be computed according 
to formulae given in [16]. 

Since activating different paths can potentially have different DMTs, it is not optimal in general to use all paths 
equally. 

When one is operating at a higher multiplexing gain, one might want to use a path with higher multiplexing 
gain more frequently in order to get greater average rate. While operating at a low rate, all the paths must be used 
in order to get maximum diversity. We consider a generic case where path i is activated for a fraction /j of the 
duration. These fractions can be chosen depending on r in order to maximize d(r). 

By so doing, we will get a parallel channel with repeated coefficients. The DMT of such a channel was evaluated 
in Lemma 13.81 The conversion however entails a loss factor, which is equal to the total number of time instants 
for which the channel was used. After making this rate correction, we get the following formula by modifying 
equation (155T ). So the achievable DMT is given by, k 

d(r) > sup inf / dj{rj) 

{h,h,-,f K ) (ri,r 2 ,-,r K ): ££1 /,r,=r ^ 

(103) 

2) Half Duplex KPP(I) Networks: From Section |Vll we know that under the half duplex constraint, there exists 
a protocol activating the K paths equally for KPP(I) networks with K > 3 causing only causal interference. We 
can use the same protocol notwithstanding the fact that the relays contain multiple antennas. By doing so, we will 
get a transfer matrix which will be lower triangular. Also, the diagonal entries of this channel matrix would remain 
the same as though the relay nodes operate under full-duplex mode. By Theorem 13.31 this gives a lower bound on 
the DMT, and it is equal to DMT lower bound of the full duplex network in (1103b . Therefore even when there is 
half duplex constraint, we can achieve the same DMT given by the d 1031 ) with /i = i instead of the supremum. 

If we want to achieve different fractions of activation for different parallel paths, then we can follow a different 
trick for K > 4. In this case, we can use the C$ 3-parallel path networks, but activate each 3-parallel-path network 
for a different fraction of time. Using this strategy, we can show that, for K > 4, all time fractions /j for the 
parallel path Pj can be obtained as long as (/i, /2, Jk) £ F where 

- 1 

^:={(/l,/ 2 , ••.,/*) :J> = 1, </*<-} 
i=l 6 

For K > 4, this yields a DMT of K 

d(r) > sup inf / dj(rj) 

{h,h,- ,fK)&F (r u r a ,- ,r K ): Ef=i /»n=r i=1 

(104) 

This is the same as the lower bound on the DMT for the full duplex case, except that we are constrained to have 
all activation fractions /j to be lesser than one-third. 

IX. Code Design 

A. Design of DMT achieving codes 

Consider any network and protocol described above, and let us say the network is operated for M slots. Let L 
be the period of the protocol and let us assume M = mL + D for simplicity. We will assume that after D time 
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instants the KPP network comes to steady state, and we will neglect the first D time instants. Even though there 
is a rate loss of ttztq associated with that, we can make this loss arbitrarily small by making M large enough. 

The induced channel is given by Y = HX + W where X,Y,W is a. M x 1 vector and H is a M x M matrix. 
However, to design an optimal code for this channel, we need to use a space time code matrix X. In order to obtain 
an induced channel with X being a M X T matrix, we do the following. Instead of transmitting a single symbol, 
each node transmits a row vector comprising of T symbols during each activation. Then the induced channel matrix 
takes the form: Y = HX + W, with X, Y, W being M x T matrices and H the same M x M matrix as earlier. 

So there are totally MT symbols transmitted. In the matrix X, let us call the row vector of T symbols in slot i as 
x; L . To address a specific symbol: the j-th symbol in slot i, we use the notation Xy. Let us use similar notation for 
the output: y^ denotes the j-th symbol received in the i-th time slot, and y, t denotes the row vector of T symbols 
received in the i-ih time slot. 

Now from [11], we know that if we use an approximately universal code for X, then it will achieve the optimal 
DMT of the channel matrix H irrespective of the statistics of the channel. Explicit minimal delay approximately 
universal codes for the case when T = M are given in [12], constructed based on appropriate cyclic division 
algebras [18]. These codes can be used here to achieve the optimal DMT of the induced channel matrix. 

1 ) Short DMT Optimal Code Design: The code construction provided above affords a code length of TM = M 2 . 
Also we need M very large for the initial delay overhead to be minimal. This entails a very large block length, 
and indeed very high decoding complexity. Now a natural question is whether optimal DMT performance can be 
achieved with shorter block lengths. We answer this question for KPP networks by constructing DMT optimal 
codes that have T = L and a block length of L 2 , where L is the period of the protocol used. We also provide a 
DMT optimal decoding strategy that also requires only decoding a L x L matrix at a time. This is a constant which 
does not depend on M and therefore, even if we make M large, the delay and decoding complexity are unaffected. 
This code construction can be easily extended to other networks considered in this paper as well. 

After D time instants, the KPP network attains steady state. Consider the first L inputs after attaining steady 
state X£>+i, Xd+2, ■••> %D+L- If the channel matrix is restricted to these L time slots alone, then channel matrix 
would be a lower triangular matrix with the L independent coefficients i = 1, 2, .., K repeated periodically. The 
DMT of this matrix, after adjusting for rate, is c?^(r) = K(l — r) + . So if we use a L x L DMT optimal matrix as 
the input (this can be done by setting T = L and using aLx! approximately universal CDA based code for the 
input), we will be able to obtain a DMT of dx(r) for this subset of the data. This means that the probability of 
error for this vector comprising of T input symbols will be of exponential order P e = p~ dK ( r > if an ML decoder 
is used to decode the L x L matrix. 

Let us assume that the first L symbols has been decoded independently. Let us now focus on the next L received 
symbols yu+L+i, yo+L+2, ■■■iUd+l+l- These symbols potentially depend on the previous block of L symbols and 
it is optimal to decode all of these together. However we show that a Successive Interference Cancellation (SIC) 
based method is DMT optimal as well. After the first block of L symbols are decoded, its effect will be subtracted 
out from the remaining symbols, and then the next block of L symbols decoded independently. For the third block, 
the effect of the first two blocks each of length L will be subtracted out and the third block decoded independently 
and so on. 

Let us evaluate the probability of error when this SIC based method is used. Let us find the probability of error 
for B blocks after the initial D instants of silence. Let E,- L denote the event that there is an error in any of the first 
i blocks, Fi denote the event that there is an error in decoding the i-th block. Proceeding by induction on the i-th 
statement P(Ei) = p~ dK ( r \ we get 

P(Fi) = P{F i /E^ 1 )P(E^ 1 ) + P(F i /E~^)P(E~^) 

< P(£? t _i) + P{Fi/E~ x ) 

= n~ dK ( r ) + p~ dK ( r ) 

= p - d «(r) 
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i i 

=>P(Ei) = P{\jF 3 ) < £P(f» 

3=1 3=1 

= i>2p~ dK{r) = p~ dK{r) 

3=1 

Therefore, we have that the entire probability of error is of the exponential order of p~ dK ( r > and the scheme 
achieves the optimal DMT of the H matrix. 



B. Universal Full-Diversity Codes 

Consider a input output equation of the form Y = HX + W where X, Y, H, W are M x M matrices. 

Usually the code design criterion given for a input matrix to have full diversity for rayleigh fading is that the 
difference of any two possible input matrices be full rank. In this section we show that such a criterion is sufficient 
to get full diversity on any channel matrix distribution. By full diversity here, we mean that the code will attain a 
diversity equal to d(0) for the channel. 

We quote the following theorem from the theory of approximately universal codes (Theorem 3.1 in [11] ): 

Theorem 9.1: [11] A sequence of codes of rate R(p) := rlogp bits/symbol is approximately universal over the 
MIMO channel if and only if, for every pair of codewords, 

A 2 A 2 ■ • • A 2 > - = - (105) 

1 2 n m i„ — 2-R(p)+o(log p) pr 2°(logp)' 

where Ai, . . . , A„ mm are the smallest n m ; n singular values of the normalized (by 4=) codeword difference matrix. A 
sequence of codes achieves the DMT of any channel matrix if and only if it is approximately universal. 
Substituting r = corresponding to a multiplexing gain of in Theorem 19.11 we get that the criterion is 

A 2 A^---A 2 > „ , , (106) 

In particular, if a code satisfies, for all pairs of codewords, the difference determinant is non-zero, i.e., 

A? A 2 , • • • A 2 mm > L > 0, (107) 

then the code is approximately universal for a rate of r = 0, and therefore achieves, the d(0) of any given channel 
matrix. 

This criterion is the same as the criterion for full diversity on a rayleigh channel. This means that all codes with 
full diversity designed for the rayleigh fading MIMO channel are indeed full diversity for a MIMO channel with 
any fading distribution. Therefore we can use a full-diversity code designed for a rayleigh fading MIMO channel 
to get full-diversity for any KPP or Layered network, when used along with the corresponding protocol for these 
networks. 
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